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METHOD FOR PREPARING POLYFXJNCTIONALIZED PEPTIDES 
AND/OR PROTEINS VIA NATIVE CHEMICAL LIGATION 


Priority Claim 

5 [0001] This application claims priority to provisional application Nos.: 

60/500,708, filed September 5, 2003; 60/560,147, filed April 7, 2004; each of the 
above-cited Applications is hereby incorporated by reference in its entirety. 


Government Support 
10 [0002] The invention was supported in part by Grant Nos.: CA103823 

(formerly All 6943), T32-CA62948 and AI051883 from the National Institutes of 
Health. The U.S. government may have certain rights in this invention. 


Background of the Invention 

15 [0003] Glycoprotems are important biomacromolecules that are 

biosynthesized through posttranslational glycosylation of newly fashioned protems 
emergmg from the ribosome. Interest in glycoproteins arises at many levels. A 
long-term goal of the growing field of chemistry-based glycobiology is the 
delineation of the consequences of glycosylation on critical properties such as 

20 protein folding, proteolytic stability, and cell adhesion.^ Such insights could explain 
why nature bothers to glycosylate otherwise functional proteins. Moreover, 
glycoproteins have potentially important clinical roles in the context of vaccines, 
diagnostics, and therapeutics. Indeed, erythropoietin, albeit a heterogeneous 
glycoprotein,^ is clinically valuable as a treatment for anemia, among other 

25 indications."^^ 

[0004] Many naturally occurring, medicinally important glycoproteins (c/, 

for example, erythropoietin^^ and gpl20^^) display multiple glycosylation sites 
containing large oligosaccharide domains. However, given the complexity and 
variability of biological glycosylation pathways, the isolation of homogeneous 
30 glycoproteins from natural sources in significant quantity is extremely difficuh. 

[0005] Numerous methods exist for the production of glycopeptides by 

chemical synthesis. For example, glycans have been introduced into peptides via 
amino acid "cassettes" with pendant protected saccharides,^^"^* through enzymatic 
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manipulations of glycopeptides,^"* or by conjugation of fully elaborated, complex 
saccharides to short synthetic peptides.^^"^* Larger 0-linked glycopeptides have been 
synthesized using ligation techniques'^^''*^ such as expressed protein ligation. 
Bertozzi and coworkers extended the scope of the "cassette" approach by applying 

5 native chemical ligation to the synthesis of a biologically active glycoprotein with 
two single-residue 0-linked glycans.'*'^ Tolbert and Wong described the ligation of a 
392-residue intein-generated peptide thioester and a dipeptide functionalized with a 
single A^-acetylglucosamine residue. However, none of these approaches has 
allowed the assembly of complex glypeptides or glycoproteins multiply 

10 functionalized (e.g. , multiply glycosylated) at designated sites. 

[0006] Accordingly, there remains a need for novel synthetic methods 

leading to the preparation of homogeneous polyfunctionalized peptides and/or 
proteins. Specifically, convergent, stereoselective, versatile methods for preparing 
such glycopeptides and/or glycoproteins are needed. 

15 

Summary of the Invention 
[0007] In recognition of the need to provide access to polyfunctionalized 

peptides and proteins, the present invention, in one aspect, provides a system for 
preparing a polyfunctionalized peptide comprising a peptidic backbone made up of 
20 four or more amino acids, wherein two or more non-adjacent amino acids are 
independently substituted with a moiety having the structure: 

with the proviso that the peptide sequence between any two consecutive, 
non-adjacent, amino acids bearing a A-L*- moiety comprises at least one cysteine 
25 residue; 

wherein the method comprises a step of: 

reacting a peptide acyl donor comprising a peptidic backbone made up of 
two or more amino acids wherein said peptide acyl donor has the structure: 
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/ I Peptide Backbone qf^o 
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with a peptide amine acceptor having the structure: 



' Peptide Backbone k rX2 


Y 


o 

under suitable conditions to effect ligation; 
5 wherein kl and k2 are independently integers between 1 and about 20; 

each occurrence of A, Ai and A2 is independently an aliphatic, 
heteroaliphatic, aromatic, heteroaromatic, aryl, heteroaryl or a pharmaceutically 
useful group or entity; 

R^^ is a sulfide protecting group; 
10 R^^ is a group such that the moiety -C(=0)OR^^ can be made to undergo 

ligation with the peptide amine acceptor; 

each occurrence of is independently a substituted or unsubstituted, linear 
or branched, cyclic or acyclic, saturated or unsaturated aliphatic or heteroaliphatic 
moiety; 

15 R^* is hydrogen, alkyl, acyl, aromatic, heteroaromatic, aryl, heteroaryl, - 

alkyl(aryl), -alkyl(heteroaryl), a nitrogen protecting group, an amino acid or a 
proctected amino acid; 

R^ is -OR^" or -NR^^R^', wherein R^^ is hydrogen, alkyl, aromatic, 
heteroaromatic, aryl, heteroaryl, -alkyl(aryl), -alkyl(heteroaryl), a carboxylic acid 

20 protecting group, an amino acid or a proctected amino acid; and R and R are 
independently hydrogen, alkyl, aromatic, heteroaromatic, aryl, heteroaryl, - 
alkyl(aryl), -alkyl(heteroaryl), a nitrogen protecting group, an amino acid or a 
proctected amino acid. 
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15 


20 


[0008] In certain embodiments, each occurrence of A is independently a 

phannaceutically useful group or entity. In certain embodiments, each occurrence of 
A is independently a biomolecule, a small molecule, a macromolecule or a 
diagnostic label. 

[0009] In certain exemplary embodiments, each occurrence of A is 

independently a carbohydrate determinant having the structure: 

L Jy R3 

wherein a, b, c, d, e, f, g, h, i, x, y and z are independently 0, 1, 2 or 3, with 
the proviso that the x, y and z bracketed structures represent furanose or pyranose 
moieties and the sum of b and c is 1 or 2, the sum of d and f is 1 or 2, and the sum of 
g and i is 1 or 2, and with the proviso that x, y and z are not simultaneously 0; 
wherein Ro is hydrogen, a Imear or branched chain alkyl, acyl, arylalkyl or aryl 
group; wherein each occurrence of Ri, R2, R3, R4, R5> Re > R?, Rs and R9 is 
independently hydrogen, OH, OR', NHR^ NHCOR\ F, CH2OH, CHzOR^ a 
substituted or unsubstituted , linear or branched chain alkyl, (mono-, di- or 
tri)hydroxyalkyl, (mono-, di- or tri)acyloxyalkyl, arylalkyl or aryl group; wherein 
each occurrence of R' is independently hydrogen, CHO, COOR*^ or a substituted or 
unsubstituted linear or branched chain alkyl, acyl, arylalkyl or aryl group or a 
saccharide moiety having the structure: 

R12 




wherein Y and Z are independently NH or 0; wherein k, 1, r, s, t, u, v and w 
are each independently 0, 1 or 2; with the proviso that the v and w bracketed 
structures represent furanose or pyranose moieties and the sum of 1 and k is 1 or 2, 
and the sum of s and u is 1 or 2, and with the proviso that v and w are not 
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simultaneously 0; wherein R'o is hydrogen, a linear or branched chain alkyl, acyl, 
arylalkyl or aryl group; wherein each occurrence of Rjo, Rn, R12, Ris, Ru and R15 is 
independently hydrogen, OH, OR*", NHR"^ NHCOR'", F, CH2OH, CH20R"\ or a 
substituted or unsubstituted linear or branched chain alkyl, (mono-, di- or 

5 tri)hydroxyalkyl, (mono-, di- or tri)acyloxyalkyl, arylalkyl or aryl group; wherein 
each occurrence of R16 is hydrogen, COOH, COOR", CONHR", a substituted or 
unsubstituted linear or branched chain alkyl or aryl group; wherein each occurrence 
of R"' is hydrogen, CHO, COOR'", or a substituted or unsubstituted Imear or 
branched chain alkyl, acyl, arylalkyl or aryl group; and wherein each occurrence of 

10 R" and R''' are each independently H, or a substituted or unsubstituted linear or 
branched chain alkyl, arylalkyl or aryl group. 

[0010] In certain embodiments, the present invention provides a method for 

preparing a polyfimctionalized peptide having the structure: 



15 wherein si and s2 are independently an integer from 1 to about 20; 

tl, t2 and t3 are each independently an integer; 

R'^^ is hydrogen, alkyl, acyl, aromatic, heteroaromatic, aryl, heteroaryl, - 
alkyl(aryl), -alkyl(heteroaryl), a nitrogen protecting group, an amino acid or a 
proctected amino acid; 

20 R^ is -OR^^ or -NR^^^^ wherein R^" is hydrogen, alkyl, aromatic, 

heteroaromatic, aryl, heteroaryl, -alkyl(aryl), -alkyl(heteroaryl), a carboxylic acid 

^Qb X2c 

protecting group, an amino acid or a proctected amino acid; and R and R are 
independently hydrogen, alkyl, aromatic, heteroaromatic, aryl, heteroaryl, - 
alkyl(aryl), -alkyl(heteroaryl), a nitrogen protecting group, an amino acid or a 
25 proctected amino acid; 
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bF\ and R^^ are independently H, alkyl, heteroalkyl, aromatic, 
heteroaromatic, aryl, heteroaryl, -alkyl(aryl), -alkyl(heteroaryl), or a natural or non- 
natural amino acid side chain; 

each occurrence of is independently a substituted or unsubstituted 
aliphatic or heteroaliphatic moiety; 

Ai and A2 are each independently an. aliphatic, heteroaliphatic, aromatic, 
heteroaromatic, aryl, heteroaryl or a pharmaceatically useful group or entity; and 

at least one occurrence of the bracketed structure t2 is a cysteine residue or 
protected cysteine residue; 

wherein the method comprise a step of: 

reacting a peptide acyl donor having the structure: 



with a peptide amine acceptor having the structure: 



15 under suitable conditions to effect ligation; 

wherein the sum t+t' equals (t2)+l; R^^ is a sulfide protecting group; and R^° 
is a group such that the moiety -C(=0)OR^^ can be made to undergo ligation with 
the glycopeptide amine acceptor. 

[0011] In certain exemplary embodiments, Aj and A2 are each independently 

20 a carbohydrate domain as defined for A above- 
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[0012] In certain embodiments, the step may be cairied out once, or repeated 

a desired number of times, to prepare the polyfunctionalized peptide having the 
structure: 



5 wherein R and R are as defined above; 

each occurrence of A may be the same or different and may be as defined for 
Ai and A2 above; 

each occurrence of R^^ may be the same or different and may be as defined 

forR^^ and above; 
10 q is an integer greater than or equal to 2; 

each occurrence of s is independently an integer from 1 to about 20; 
each occurrence of t is independently an integer; 
to is an integer; and 

each occurrence of R^° is independently H, alkyl, heteroalkyl, aromatic, 
15 heteroaromatic, aryl, heteroaryl -alkyl(aryl), -alkyl(heteroaryl), or a natural or non- 
natural amino acid side chain. 

Definitions 

[0013] Certam specific functional groups defined in the inventive method are 

20 described in more detail below. For purposes of this invention, the chemical 
elements are identified in accordance with the Periodic Table of the Elements, CAS 
version, Handbook of Chemistry and Physics, 75* Ed., inside cover, and specific 
functional groups are defined as described therein. Additionally, general principles 
of organic chemistry, as well as specific functional moieties and reactivity, are 
25 described m "Organic Chemistry", Thomas Sorrell, University Science Books, 
Sausalito: 1999, the entire contents of which are incorporated herein by reference. 
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[0014] It will be appreciated that the compounds, as described herein, may 

be substituted with any number of substituents or functional moieties. In general, 
the term "substituted" whether preceded by the term "optionally" or not, and 
substituents contained in formulas of this invention, refer to the replacement of 

5 hydrogen radicals in a given structure with, the radical of a specified substituent 
When more than one position in any given structure may be substituted with more 
than one substituent selected from a specified group, the substituent may be either 
the same or different at every position. As used herein, the term "substituted" is 
contemplated to include all pemiissible substituents of organic compounds. In a 

10 broad aspect, the permissible substituents include acyclic and cyclic, branched and 
unbranched, carbocyclic and heterocyclic, airomatic and non-aromatic, carbon and 
heteroatom substituents of organic compounds. For purposes of this invention, 
heteroatoms such as nitrogen may have hydrogen substituents and/or any 
permissible substituents of organic compounds described herein which satisfy the 

15 valencies of the heteroatoms. Furthermore, this invention is not intended to be 
limited in any maimer by the permissible substituents of organic compoimds. 
Combinations of substituents and variables envisioned by this invention are 
preferably those that result in the formation of stable compounds useful in the 
treatment and prevention, for example of disorders, as described generally above. 

20 Examples of substituents include, but are not limited to aliphatic; heteroaliphatic; 
alicyclic; heteroalicyclic; aromatic, heteroaromatic; aryl; heteroaryl; alkylaryl; 
alkylheteroaryl; alkoxy; aryloxy; heteroalkoxy; heteroaryloxy; alkylthio; arylthio; 
heteroalkylthio; heteroarylthio; F; CI; Br; I; -NO2; -CN; -CF3; -CH2CF3; -CHCh; - 
CH2OH; -CH2CH2OH; -CH2NH2; -CH2SO2CH3; - or -GR^^ wherem G is -0-, -S-, - 

25 mP\ -c(-o)-, -seoH -SO2-, -c(=o)o-, -c(=o)nr^^-, -oc(=o)-, -nr^^c(=0)-, 

-0C(=0)0-, -OC(=0)NR^-, -NR^C(=O)0-, -NR^^C(=0)NR^^-, -C(=S)-, - 
C(=S)S-, -SC(=S)-, -SC(=S)S-, -C(-NR^)-, -C(=NR°^)0-, -C(=NR^^)NR^-, - 
OC(=NR^^)-, -NR^C(=NR^)-, -NR^SOz-, -NR^^SOjNR^^-, or -S02NR^^-, 
wherein each occurrence of and R^^ independently includes, but is not 

30 limited to, hydrogen, halogen, or an optionally substituted aliphatic, heteroaliphatic, 
alicyclic, heteroalicyclic, aromatic, heteroaromatic, aryl, heteroaryl, alkylaryl, or 
alkylheteroaryl moiety. Additional examples of generally applicable substituents are 
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illustrated by the specific embodiments shown in the Examples that are described 
herein. 

[0015] The term "stable", as used herein, preferably refers to compounds 

which possess stability sufficient to allow manufacture and which maintain the 
5 integrity of the compound for a sufficient period of time to be useful for the 
purposes detailed herein. 

[0016] The term "aliphatic", as used herein, includes both saturated and 

unsaturated, straight chain (z.e,, unbranched) or branched aliphatic hydrocarbons, 
which are optionally substituted with one or more functional groups. As will be 
10 appreciated by one of ordinary skill in the art, "aliphatic" is intended herein to 
include, but is not limited to, alkyl, alkenyU alkynyl moieties. Thus, as used herein, 
the term "alkyl" includes straight and branched alkyl groups. An analogous 
convention applies to other generic terms such as "alkenyl", "alkynyl" and the like. 
Furthermore, as used herein, the terms "alkyl", "alkenyl", "alkynyl" and the like 
15 encompass both substituted and unsubstituted groups. In certain embodiments, as 
used herein, "lower alkyl" is used to indicate those alkyl groups (substituted, 
unsubstituted, branched or imbranched) having about 1-6 carbon atoms. 
[0017] In certain embodiments, the alkyl, alkenyl and alkynyl groups 

employed in the invention contain about 1-20 aliphatic carbon atoms. In certain 
20 other embodiments, the alkyl, alkenyl, and alkynyl groups employed in the invention 
contain about 1-10 aliphatic carbon atoms. In yet other embodunents, the allcyl, 
alkenyl, and alkynyl groups employed in the invention contain about 1-8 aliphatic 
carbon atoms. In still other embodiments, the alkyl, alkenyl, and alkynyl groups 
employed in the invention contain about 1-6 aliphatic carbon atoms. In yet other 
25 embodiments, the alkyl, alkenyl, and alkynyl groups employed in the invention 
contain about 1-4 carbon atoms. Illustrative aliphatic groups thus include, but are 
not limited to, for example, methyl, ethyl, n-propyl, isopropyl, allyl, n-butyl, sec- 
butyl, isobutyl, tert-butyl, n-pentyl, sec-pentyl, isopentyl, tert-pentyl, n-hexyl, sec- 
hexyl, moieties and the like, which again, may bear one or more substituents. 
30 Alkenyl groups include, but are not limited to, for example, ethenyl, propenyl, 
butenyl, l-methyl-2-buten-l-yl, and the like. Representative alkynyl groups include, 
but are not limited to, ethynyl, 2-propynyl (propargyl), 1-propynyl and the like. 
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[0018] The term "alicyclic", as used herein, refers to compounds which 

combine the properties of aliphatic and cyclic compounds and include but are not 
limited to cyclic, or polycyclic aliphatic hydrocarbons and bridged cycloalkyl 
compounds, which are optionally substituted with one or more functional groups. 

5 As will be appreciated by one of ordinary skill in the art, "alicyclic" is intended 
herein to include, but is not limited to, cycloalkyl, cycloalkenyl, and cycloalkynyl 
moieties, which are optionally substituted with one or more functional groups. 
Illustrative alicyclic groups thus include, but are not limited to, for example, 
cyclopropyl, -CH2-cyclopropyl, cyclobutyl, -CHa-cyclobutyl, cyclopentyl, -CH2- 

10 cyclopentyl-n, cyclohexyl, -CH2-cyclohexyl, cyclohexenylethyl, cyclohexanylethyl, 
norborbyl moieties and the like, which again, may bear one or more substituents. 
[0019] The term "cycloalkyl", as used herein, refers specifically to groups 

having three to seven, preferably three to ten carbon atoms. Suitable cycloalkyls 
include, but are not limited to cyclopropyl, cyclobutyl, cyclopentyl, cyclohexyl, 

15 cycloheptyl and the like, which, as in the case of aliphatic, heteroaliphatic or 
heterocyclic moieties, may optionally be substituted. An analogous convention 
applies to other generic terms such as "cycloalkenyl", "cycloalkynyl" and the like. 
[0020] The term "heteroaliphatic", as used herein, refers to aliphatic moieties 

m which one or more carbon atoms in the main chain have been substituted with a 

20 heteroatom. Thus, a heteroaliphatic group refers to an aliphatic chain which 
contains one or more oxygen, sulfur, nitrogen, phosphorus or silicon atoms, e.g., in 
place of carbon atoms. Heteroaliphatic moieties may be branched or linear 
unbranched. In certain embodiments, heteroaliphatic moieties are substituted by 
independent replacement of one or more of the hydrogen atoms thereon with one or 

25 more moieties including, but not limited to aliphatic; heteroaliphatic; alicyclic; 
heteroalicyclic; aromatic, heteroaromatic; aryl; heteroaryl; alkylaryl; alkylheteroaryl; 
alkoxy; aryloxy; heteroalkoxy; heteroaryloxy; alkylthio; arylthio; heteroalkylthio; 
heteroarylthio; F; CI; Br; I; -NO2; -CN; -CF3; -CH2CF3; -CHCI2; -CH2OH; - 
CH2CH2OH; -CH2NH2; -CH2SO2CH3; - or -GR^^ wherein G is -0-, -S-, -NR^-, - 

30 C(=0)-, .S(=0)-, -SO2-, -C(=0)0-, -C(=0)NR^-, -0C(=0)-, -NR^C(=0)-, - 
0C(=0)0-, -0C(==0)NR^2-, -NR^^C(=0)0-, -NR^C(=0)NR^-, -C(=S)-, - 
C(=S)S-, -SC(=S)-, .SC(=S)S-, -C(=NR^^)-, -C(=NR^2)0-, -C(=NR^)NR^^-, - 
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OC(=NR^)-, -NR^C(=NR*^>, -NR^'SOa-, -NR^^SO^NR^^-, or -SOzNR^-, 
wherein each occurrence of R^ and R^^ independently includes, but is not 
limited to, hydrogen, halogen, or an optionally substituted aliphatic, heteroaliphatic, 
alicyclic, heteroalicyclic, aromatic, heteroaromatic, aryl, heteroaryl, alkylaryl, or 
5 alkylheteroaryl moiety. Additional examples of generally applicable substituents are 
illustrated by the specific embodunents shown in the Examples that are described 
herein. 

[0021] The term "heteroalicyclic", "heterocycloalkyl" or "heterocyclic", as 

used herein, refers to compounds which combine the properties of heteroaliphatic 

10 and cyclic compounds and include but are not limited to saturated and unsaturated 
mono- or polycyclic heterocycles such as morpholino, pyrrolidinyl, furanyl, 
thiofiiranyl, pyrrolyl etc., which are optiomally substituted with one or more 
functional groups, as defined herem. In certain embodiments, the term 
"heterocyclic" refers to a non-aromatic 5-, 6- or 7- membered ring or a polycyclic 

15 group, including, but not limited to a bi- or tri-cyclic group comprising fused six- 
membered rings having between one and three heteroatoms independently selected 
from oxygen, sulfur and nitrogen, wherein (i) each 5 -membered ring has 0 to 2 
double bonds and each 6-membered ring has 0 to 2 double bonds, (ii) the nitrogen 
and sulfur heteroatoms may optionally be oxidized, (iii) the nitrogen heteroatom 

20 may optionally be quatemized, and (iv) any of the above heterocyclic rings may be 
fused to an aryl or heteroaryl ring. Representative heterocycles include, but are not 
limited to, pyrrolidinyl, pyrazolinyl, pyrazolidinyl, imidazolinyl, imidazolidmyl, 
piperidinyl, piperazmyl, oxazolidinyl, isoxazolidinyl, morpholinyl, thiazolidinyl, 
isothiazolidinyl, and tetrahydrofuryl. In certain embodiments, a "substituted 

25 heterocycloalkyl or heterocycle" group is utilized and as used herein, refers to a 
heterocycloalkyl or heterocycle group, as defined above, substituted by the 
independent replacement of one, two or three of the hydrogen atoms thereon with 
but are not limited to aliphatic; heteroaliphatic; alicyclic; heteroalicyclic; aromatic, 
heteroaromatic; aryl; heteroaryl; alkylaryl; alkylheteroaryl; alkoxy; aryloxy; 

30 heteroalkoxy; heteroaryloxy; alkylthio; arylthio; heteroalkylthio; heteroarylthio; F; 
CI; Br; I; -NO2; -CN; -CF3; -CH2CF3; -CHCI2; -CH2OH; -CH2CH2OH; -CH2NH2; - 
CH2SO2CH3; - or -GR^^ wherem G is -S-, -NR^^-, -C(=0)-, -S(=0)-, -SO2-, - 
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C(=0)0-, -C(=0)NR^-, -0C(=0)-, -NR'^CC^O)-, -0C(=0)0-, -OC(=0)NR^^-, - 
NR^C(=0)0^, .NR^C(=0)NR^-, -C(=S)-, -C(-S)S-, -SC(-S)-, -SC(-S)S-, - 
C(=NR^H .C(=NR°^)0-, -C(=NR^)NR^-, -OC(=NR°^)-, -NR°^C(-NR^^)-, - 
NR*^^S02-, -NR^^SOzNR^^-, or .S02NR^-, wherein each occurrence of R^^ 

5 and R^^ independently mcludes, but is not limited to, hydrogen, halogen, or an 
optionally substituted aliphatic, heteroaliphatic, alicyclic, heteroalicyclic, aromatic, 
heteroaromatic, aryl, heteroaryl, alkylaryl, or alkylheteroaryl moiety. Additional 
examples or generally applicable substituents are illustrated by the specific 
embodiments shown in'the Examples, which are described herein. 

10 [0022] Additionally, it will be appreciated that any of the alicyclic or 

heteroalicyclic moieties described above and herein may comprise an aryl or 
heteroaryl moiety fused thereto. Additional examples of generally applicable 
substituents are illustrated by the specific embodiments shown in the Examples that 
are described herein. 

15 [0023] In general, the term "aromatic moiety", as used herein, refers to 

stable substituted or unsubstituted unsaturated mono- or polycycUc hydrocarbon 
moieties having preferably 3-14 carbon atoms, comprising at least one ring 
satisfying the Huckel rule for aromaticity. Examples of aromatic moieties include, 
but are not limited to, phenyl, indanyl, indenyl, naphthyl, phenanthryl and anthracyl. 

20 [0024] In general, the temi "heteroaromatic moiety", as used herein, refers to 

stable substituted or unsubstituted unsaturated mono-heterocyclic or 
polyheterocyclic moieties having preferably 3-14 carbon atoms, comprising at least 
one ring satisfying the Huckel rule for aromaticity. Examples of heteroaromatic 
moieties include, but are not limited to, pyridyl, quinolinyl, dihydroquinolinyl, 

25 isoquinolinyl, quinazolinyl, dihydroquinazolyl, and tetrahydroquinazolyl. 

[0025] It will also be appreciated that aromatic and heteroaromatic moieties, 

as defined herein, may be attached via an aliphatic (e.g., alkyl) or heteroaliphatic 
(e.g., heteroalkyl) moiety and thus also include moieties such as - 
(aliphatic)aromatic, -(heteroaliphatic)aromatic, -(aliphatic)heteroaromatic, - 

30 (heteroaliphatic)heteroaromatic, -(alkyl)aromatic, -(heteroallcyl)aromatic, 

(alkyl)heteroaromatic, and -(heteroalkyl)heteroaromatic moieties. Thus, as used 
herein, the phrases "aromatic or heteroaromatic moieties" and "aromatic, 
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heteroaromatic, -(alkyl)aromatic, -(heteroalkyl)aromatic, 

(heteroallcyl)heteroaromatic, and -<heteroalkyl)heteroaromatic" are interchangeable. 
Substituents include, but are not limited to, any of the previously mentioned 
substituents, /,e., the substituents recited for aliphatic moieties, or for other moieties 

5 as disclosed herein, resulting in the formation of a stable compound. 

[0026] In general, the term "aryl" refers to aromatic moieties, as described 

above, excluding those attached via an aliphatic (e.g., alkyl) or heteroaliphatic (e.g., 
heteroalkyl) moiety. In certain embodiments of the present invention, "aryl" refers to 
a mono- or bicyclic carbocyclic ring system having one or two rings satisfying the 

10 Huckel rule for aromaticity, including, but not limited to, phenyl, naphthyl, 
tetrahydronaphthyl, indanyl, indenyl and the like. 

[0027] Similarly, the temi "heteroaryl" refers to heteroaromatic moieties, as 

described above, excluding those attached via an aliphatic (e.g., alkyl) or 
heteroaliphatic (e.g., heteroalkyl) moiety. In certain embodiments of the present 

15 invention, the temi "heteroaryl", as used herein, refers to a cyclic xmsaturated radical 
having from about five to about ten ring atoms of which one ring atom is selected 
from S, O and N; zero, one or two ring atoms are additional heteroatoms 
independently selected from S, O and N; and the remaining ring atoms are carbon, 
the radical being joined to the rest of the molecule via any of the ring atoms, such as, 

20 for example, pyridyl, pyrazinyl, pyrimidinyl, pyrrolyl, pyrazolyl, imidazolyl, 
thiazolyl, oxazolyl, isooxazolyl, thiadiazolyl, oxadiazolyl, thiophenyl, furanyl, 
quinolinyl, isoquinolinyl, and the like. 

[0028] Substituents for aryl and heteroaryl moieties include, but are not 

limited to, any of the previously mentioned substitutents, /.e., the substituents recited 

25 for aliphatic moieties, or for other moieties as disclosed herein, resulting in the 
formation of a stable compound. For example, aryl and heteroaryl groups (including 
bicyclic aryl groups) can be unsubstituted or substituted, wherein substitution 
includes replacement of one, two or three of the hydrogen atoms thereon 
independently with any one or more of the following moieties including, but not 

30 limited to: aliphatic; heteroaliphatic; alicyclic; heteroalicyclic; aromatic, 
heteroaromatic; aryl; heteroaryl; alkylaryl; alkylheteroaryl; alkoxy; aryloxy; 
heteroalkoxy; heteroaryloxy; alkylthio; arylthio; heteroalkylthio; heteroarylthio; F; 
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CI; Br; I; -NO2; -CN; -CF3; -CH2CF3; -CHCh; -CH2OH; -CH2CH2OH; -CH2NH2; - 
CH2SO2CH3; - or -GR°* wherein G is -NR^^-, -C(=0)-, -S(=0)-, -SO2-, - 

C(=0)0-, .C(=0)NR®-, -0C(=0)-, .NR^^C(=0)-, -0C(=0)0., -OC(=0)NR®., - 
NR^C(=0)0-, .NR°^C(=0)NR°^-, -C(=S)-, -C(=S)S-, -SC(=S)-, -SC(=S)S-, - 
5 C(=NR^)-, -C(=NR^)0-, -C(=NR^^)NR^3-, -OC(=NR^)-, -NR^C(=NR°3)-, - 
NR^S02-, -NR^2sQ^^C33_^ -S02NR^^-, wherein each occurrence of R^^ R^^ 
and R^^ independently includes, but is not limited to, hydrogen, halogen, or an 
optionally substituted aliphatic, heteroaliphatic, alicyclic, heteroalicyclic, aromatic, 
heteroaromatic, aryl, heteroaryl, alkylaryl, or alkylheteroaryl moiety. Additional 

10 examples of generally applicable substituents are illustrated by the specific 
embodiments shown in the Examples that are described herein. 
[0029] The terms "alkoxy" (or "alkyloxy"), and "thioalkyl" as used herein 

refers to an alkyl group, as previously defined, attached to the parent molecular 
moiety through an oxygen atom ("alkoxy") or through a sulfur atom ("thioalkyl"). 

15 In certain embodiments, the alkyl group contains about 1-20 aliphatic carbon atoms. 
In certain other embodiments, the alkyl group contains about 1-10 aliphatic carbon 
atoms. In yet other embodiments, the alkyl group contains about 1-8 aliphatic 
carbon atoms. In still other embodiments, the alkyl group contains about 1-6 
aliphatic carbon atoms. In yet other embodiments, the alkyl group contains about 1- 

20 4 aliphatic carbon atoms. Examples of alkoxy groups, include but are not limited to, 
methoxy, ethoxy, propoxy, isopropoxy, n-butoxy, tert-butoxy, neopentoxy and n- 
hexoxy. Examples of thioalkyl groups include, but are not limited to, methylthio, 
ethylthio, propylthio, isopropylthio, n-butylthio, and the like. 
[0030] The term "amine" refers to a group having the structure -N(R)2 

25 wherein each occurrence of R is independently hydrogen, or an aliphatic, 
heteroaliphatic, aromatic or heteroaromatic moiety, or the R groups, taken together, 
may form a heterocyclic moiety. 

[0031] The term "alkylamino" refers to a group having the structure - 

NHR'wherein R' is alkyl, as defined herein. The term "aminoalkyl" refers to a 
30 group having the structure NH2R'-, wherein R' is alkyl, as defined herein. In certain 
embodiments, the alkyl group contains about 1-20 aliphatic carbon atoms. In certain 
other embodiments, the alkyl group contains about 1-10 aliphatic carbon atoms. In 
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yet other embodiments, the alkyl, alkenyl, and alkynyl groups employed in the 
invention contain about 1-8 aliphatic carbon atoms. In still other embodiments, the 
alkyl group contains about 1-6 aliphatic carbon atoms. In yet other embodiments, 
the alkyl group contains about M aliphatic carbon atoms. Examples of alkylamino 
5 include, but are not limited to, methylamino, ethylamino, iso-propylamino and the 
like. 

[0032] The terms "halo" and "halogen" as used herein refer to an atom 

selected from fluorine, chlorine, bromine and iodine. 

[0033] The term "acyloxy", as used herein, does not substantially differ from 

10 the common meaning of this term in the art, and refers to a moiety of structure - 
OC(0)Rx, wherein Rx is a substituted or unsubstituted aliphatic, alicyclic, 
heteroaliphatic, heteroalicyclic, aryl or heteroaryl moiety. 

[0034] The term "acyl", as used herein, does not substantially differ from the 

common meaning of this term in the art, and refers to a moiety of structure - 
15 C(0)Rx, wherein Rx is a substituted or unsubstituted, aliphatic, alicyclic, 
heteroaliphatic, heteroalicyclic, aryl or heteroaryl moiety. 

[0035] The term "imine", as used herein, does not substantially differ from 

the common meaning of this term in the art, and refers to a moiety of structure - 
C(=NRx)Ry, wherein Rx is hydrogen or an optionally substituted aliphatic, alicyclic, 
20 heteroaliphatic, heteroalicyclic, aryl or heteroaryl moiety; and Ry is an optionally 
substituted aliphatic, alicyclic, heteroaliphatic, heteroalicyclic, aryl or .heteroaryl 
moiety. 

[0036] As used herem, the terms "aliphatic", "heteroaliphatiq", "alkyl", 

"alkenyl", "alkynyl", "heteroalkyl", "heteroalkenyl", "heteroalkynyl", and the like 

25 encompass substituted and unsubstituted, saturated and vmsaturated, and linear and 
branched groups. Similarly, the terms "alicyclic", "heteroalicyclic", 
"heterocycloalkyl", "heterocycle" and the like encompass substituted and 
unsubstituted, and saturated and unsaturated groups. Additionally, the terms 
"cycloalkyl", "cycloalkenyl", "cycloalkynyl", "heterocycloalkyl", 

30 "heterocycloalkenyl", "heterocycloalkynyl", "aryl", "heteroaryl" and the like 
encompass both substituted and unsubstituted groups. 
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[0037] It will be appreciated that additional examples of generally applicable 

substitutents are illustrated by the specific embodiments shown in the Examples 
which are described herein, but are not limited to these Examples. 
[0038] By the term "protecting group'*, has used herein, it is meant that a 

5 particular functional moiety, e,g,, 0, S, or N, is temporarily blocked so that a 
reaction can be carried out selectively at another reactive site in a multifunctional 
compound. In preferred embodiments, a protecting group reacts selectively in good 
yield to give a protected substrate that is stable to the projected reactions; the 
protecting group must be selectively removed in good yield by readily available, 
10 preferably nontoxic reagents that do not attack the other functional groups; the 
protecting group forms an easily separable derivative (more preferably without the 
generation of nev^ stereogenic centers); and the protecting group has a minimum of 
additional functionality to avoid further sites of reaction. As detailed herein, 
oxygen, sulfur, nitrogen and carbon protecting groups may be utilized. For example, 
15 in certain embodiments, as detailed herein, certain exemplary oxygen protecting 
groups are utilized. These oxygen protecting groups include, but are not limited to 
methyl ethers, substituted methyl ethers (e.g., MOM (methoxymethyl ether), MTM 
(methylthiomethyl ether), BOM (benzyloxymethyl ether), PMBM or MPM (p- 
methoxybenzyloxymethyl ether), to name a few), substituted ethyl ethers, 
20 substituted benzyl ethers, silyl ethers (e.g., TMS (trimethylsilyl ether), TBS 
(triethylsilylether), TIPS (triisopropylsilyl ether), TBDMS (t-butyldimethylsilyl 
ether), tribenzyl silyl ether, TBDPS (t-butyldiphenyl silyl ether), to name a few), 
esters (e.g., formate, acetate, benzoate (Bz), trifluoroacetate, dichloroacetate, to 
name a few), carbonates, cyclic acetals and ketals. • In certain other exemplary 
25 embodiments, nitrogen protecting groups are utilized. These nitrogen protecting 
groups include, but are not limited to, carbamates (including methyl, ethyl and 
substituted ethyl carbamates (e.g., Troc), to name a few) amides, cyclic imide 
derivatives, N-Alkyl and N-Aryl amines, imine derivatives, and enamine 
derivatives, to name a few. Certain other exemplary protecting groups are detailed 
30 herein, however, it will be appreciated that the present invention is not intended to 
be limited to these protecting groups; mther, a variety of additional equivalent 
protecting groups can be readily identified using the above criteria and utilized in 
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the present invention. Additionally, a variety of protecting groups are described in 
"Protective Groups in Organic Synthesis" Third Ed. Greene, T.W. and Wuts, P.G., 
Eds., John Wiley & Sons, New York: 1999, the entire contents of which are hereby 
incorporated by reference. 

5 [0039] The term "natural amino acid side chain" as used herein refers to the 

side chain of any one of the common, naturally occurring L-amino acids found in 
naturally occurrmg proteins: glycme (Gly), alanine (Ala), valine (Val), leucine 
(Leu), isoleucine (He), lysine (Lys), arginine (Arg), histidine (His), proline (Pro), 
serine (Ser), threonine (Thr), phenylalanine (Phe), tyrosine (Tyr), tryptophan (Trp), 

10 aspartic acid (Asp), glutamic acid (Glu), asparagine (Asn), glutamine (Gin), cysteine 
(Cys) and methionine (Met). 

[0040] The term "unnatural amino acid side chain" as used herein refers to 

the side chain of all amino acids which are not natural amino acids. This includes, 
for example, a-, p-, D-, L- amino acid residues, and compounds of the general 

15 formula o wherein the side chain R is other than the amino acid side 

chains occurring in nature. 

[0041] More generally, the term "amino acid side chain", as used herein, 

encompasses natural amino acid and unnatural amino acid side chains. 

[0042] As used herein, the term "pharmaceutically useful group or entity" 

20 refers to a compound or fragment thereof, or an organic moiety which, when 
covalently attached to a peptide or protein, can exert some biological or diagnostic 
function or activity when administered to a subject, or enhance the therapeutic, 
diagnostic or preventive properties of the parent peptide and/or protein in 
biomedical appUcations, or improve safety, alter biodegradation or excretion, or is 

25 detectable. Examples of suitable pharmaceutically useful groups or entities include 
hydrophilicity/hydrophobicity modifiers, pharmacokinetic modifiers, biologically 
active modifiers, detectable modifiers. A modifier can have one or more 
pharmaceutical functions, e.g,, biological activity and/or pharmacokinetics 
modification. Pharmacokmetics modifiers can include, for example, antibodies, 

30 antigens, receptor ligands, hydrophilic, hydrophobic or charged groups. Biologically 
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active modifiers include, for example, therapeutic drugs and prodrugs, antigens, 
immunomodulators. Detectable modifiers include diagnostic labels, such as 
radioactive, fluorescent, paramagnetic, superparamagnetic, ferromagnetic. X-ray 
modulating, X-ray-opaque, ultrosound-reflective, and otixer substances detectable by 

5 one of available clinical or laboratory methods, e.g., scintigraphy, NMR 
spectroscopy, MRI, X-ray tomography, sonotomography, photoimaging, 
radioimmunoassay. Modifiers can be small molecules or macromolecules, and can 
belong to any chemical or pharmaceutical class, e.g., nucleotides, chemotherapeutic 
agents, antibacterial agents, antiviral agents, immunomodulators, hormones or 

10 analogs thereof, enzymes, inhibitors, alkaloids and therapeutic radionuclides. Viral 
and non-viral gene vectors are considered to be a pharmaceutically useful entity or 
group. 

[0043] The term "biomolecules", as used hereia, refers to molecules (e.g., 

proteins, amino acids, peptides, polynucleotides, nucleotides, carbohydrates, sugars, 

15 lipids, nucleoproteins, glycoproteins, lipoproteins, steroids, etc.) which belong to 
classes of chemical compounds, whether naturally-occuxring or artificially created 
(e.g., by synthetic or recombinant methods), that are conmionly found in cells and 
tissues. Specific classes of biomolecules include, but axe not limited to, enzymes, 
receptors, neurotransmitters, hormones, cytokines, cell response modifiers such as 

20 growth factors and chemotactic factors, antibodies, vaccines, haptens, toxins, 
interferons, ribozymes, anti-sense agents, plasmids, DNA., and RNA. 
[0044] As used herein, the term "small molecule" refers to molecules, 

whether naturally-occxirring or artificially created (e.g., via chemical synthesis) that 
have a relatively low molecular weight. Preferred small molecules are biologically 

25 active in that they produce a local or systemic effect in animals, preferably 
mammals, more preferably humans. Typically, small irxolecxiles have a molecular 
weight of less than about 1500 g/mol. In certain prefenred embodiments, the small 
molecule is a drug. Preferably, though not necessarily, the drug is one that has 
already been deemed safe and effective for use by the appropriate governmental 

30 agency or body. For example, drugs for human use listed by the FDA under 21 
C.F.R. §§ 330.5, 331 through 361, and 440 through 460; drugs for veterinary use 
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listed by the FDA under 21 C.F.R. §§ 500 through 589, incorporated herein by 
reference, are all considered suitable for use with the present hydrophilic polymers. 
[0045] Classes of small molecule drugs that can be used in the practice of the 

present invention include, but are not limited to, vitamins, anti-AIDS substances, 

5 anti-cancer substances, antibiotics, immunosuppressants, anti-viral substances, 
enzyme inhibitors, neurotoxins, opioids, hypnotics, anti-histamines, lubricants, 
tranquilizers, anti-convulsants, muscle relaxants and anti-Parkinson substances, 
anti-spasmodics and muscle contractants including channel blockers, miotics and 
anti-cholinergics, anti-glaucoma compounds, anti-parasite and/or anti-protozoal 

10 compounds, modulators of cell-extracellular matrix interactions including cell 
growth inhibitors and anti-adhesion molecules, vasodilating agents, inhibitors of 
DNA, RNA or protein synthesis, anti-hypertensives, analgesics, anti-pyretics, 
steroidal and non-steroidal anti-inflammatory agents, anti-angiogenic factors, anti- 
secretory factors, anticoagulants and/or antithrombotic agents, local anesthetics, 

15 ophthalmics, prostaglandins, anti-depressants, anti-psychotic substances, 
anti-emetics, imaging agents , Many large molecules are also drugs. 
[0046] A more complete, although not exhaustive, listing of classes and 

specific drugs suitable for practicing the present invention may be found in 
"Pharmaceutical Substances: Syntheses, Patents, Applications" by Axel Kleemarm 

20 and Jurgen Engel, Thieme Medical Publishing, 1999 and the "Merck Index: An 
Encyclopedia of Chemicals, Drugs, and Biologicals", Edited by Susan Budavari et 
al, CRC Press, 1996, both of which are incorporated herein by reference, 
[0047] As used herein, the term "macromolecules" refers to molecxiles, 

whether naturally-occurring or artificially created (e.g., via chemical synthesis) that 

25 have a relatively high molecular weight, e.g., generally above 1500 g/mole. 
Preferred macromolecules are biologically active in that they exert a biological 
function in animals, preferably mammals, more preferably humans. Examples of 
macromolecules include proteins, enzymes, growth factors, cytokines, peptides, 
polypeptides, polylysine, proteins, lipids, polyelectrolytes, immimoglobulins, DNA, 

30 RNA, ribozymes, plasmids, and lectins. For the purpose of this invention, 
supramolecular constructs such as viruses and protein associates (e.g., dimers) are 
considered to be macromolecules. When covalently attached to a peptide or protein, 
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a macromolecule may be chemically modified prior to being covalently attached to 
said peptide or protein. 

[00481 As used herein, the term "diagnostic label'* refers to an atom, group 

of atoms, moiety or functional group, a nanocrystal, or other discrete element of a 

5 composition of matter, that can be detected in vivo or ex vivo usmg analytical 
methods known in the art. When covalently attached to a peptide or protein, such 
diagnostic labels permit the monitoring of the peptide or protein in vivo. On the 
other hand, constructs and compositions that include diagnostic labels can be used to 
monitor biological functions or structures. Examples of diagnostic labels include, 

10 without limitations, labels that can be used in medical diagnostic procedures, such 
as, radiopharmaceutical or radioactive isotopes for gamma scintigraphy and Positron 
Emission Tomography (PET), contrast agent for Magnetic Resonance Imaging 
(MRI) (for example paramagnetic atoms and superparamagnetic nanocrystals), 
contrast agent for computed tomography, contrast agent for X-ray imaging method, 

15 agent for ultrasound diagnostic method, agent for neutron activation, and moiety 
which can reflect, scatter or affect X-rays, ultrasounds, radiowaves and microwaves, 
fluorophores in various optical procedures, etc. 

Brief Description of the Drawing 
20 [0049] Figure 1 depicts examples of bifunctional glycopeptides {Le., 

diglycosylated peptides) containing 0-linkages preparaed by the method of the 
invention. 

[0050] Figure 2 depicts anomeric p-glycosylamines (22 and 23) that differ at 

a single stereocenter (asterisks), and a biftinctional glycopeptide (24) containing one 
25 of these structurally complex glycans. 

Detailed Description of certain preferred embodiments of the 

Invention 

[0051] In one aspect, the present invention provides novel methodologies for 

30 the synthesis of peptides and/or proteins functionalized at two or more non-adjacent 
amino acid sites (e.g., bearing a pharmaceutically useful group or entity covalently 
attached to at least two non-adjacent amino acid residues of the polypeptide/protein 
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chain). In certain embodiments, the invention provides a system for the synthesis of 
polypeptides and proteins bearing at least two glycans co*valently attached to non- 
adjacent amino acid residues of the polypeptide/protein chain. In certain 
embodiments, in the context of synthetic studies developed for the total synthesis of 

5 glycosylated fragments of erythropoietin, generalized methodologies were 
developed for the improved synthesis of polyglycosylated peptides and proteins. 
This general synthetic method encompasses the realization that native chemical 
ligation (NCL) is a glycan-compatible process that may be used for assembling large 
multiply glycosylated polypeptides from glycopeptide "building blocks. In yet 

10 another aspect, the present invention also provides the recognition that 
chemoselective reaction of a glycopeptide containing a C -terminal cysteine with a 
glycopeptide thioester may be achieved to from the corresponding diglycosylated 
peptide adduct. 

[0052] Specific examples, particularly with respect to the synthesis of 

15 bifunctional glycopeptides (i.e., diglycosylated peptides), are described in more 
detail below and in the Exemplification herein, along with certain general 
methodologies developed during the course of these syntheses. It will be 
appreciated by one of ordinary skill in the art that these examples are not intended to 
be limiting; rather all equivalents are intended to be incorporated into the scope of 
20 the present invention. In particular, the inventive method may be generally adapted 
to the preparation of polyftmctionalized peptides and proteins. 
[0053] 1) Description of certain embodiments of the inventive method 

[0054] In one aspect of the present invention, there is provided a method for 

preparing a polyftmctionalized peptide comprising a peptLdic backbone made up of 
25 four or more amino acids wherein two or more non-adjacent amino acids are 
independently substituted with a moiety having the structure: 


unsubstituted, linear or branched, cyclic or acyclic, saturated or unsaturated aliphatic 
30 or heteroaUphatic moiety; 



wherein each occurrence of L is independently a substituted or 
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each occurrence of A is independently an aliphatic, heteroaliphatic, 
aromatic, heteroaromatic, aryl, heteroaryl or a pharmaceutically useful group or 
entity; 

with the proviso that the peptide sequence between any two consecutive, 
5 non-adjacent, amino acids bearing a A-L^- moiety comprises at least one cysteine 
residue. 

In one aspect of the present invention, there is provided a method for 
preparing a polyfunctionalized peptide comprising a peptidic backbone made up of 
four or more amino acids wherein two or more non-adjacent amino acids are 
10 independently substituted with a moiety having the structure: 

with the proviso that the peptide sequence between any two consecutive, 
non-adjacent, amino acids bearing a A-L'- moiety comprises at least one cysteine 
residue; 

15 wherein tlie method comprises a step of: 

reacting a peptide acyl donor comprising a peptidic backbone made up of 
two or more amino acids wherein said peptide acyl donor has the structure: 


20 


A Peptide Ba ckbone | \^ oR^° 


Y 


O 

with a peptide amine acceptor having the structure: 



Peptide Backbone k rX2 


o 


Y 


imder suitable conditions to effect ligation; 

wherein kl and k2 are independently integers between 1 and about 20; 
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each occxirrence of A, Ai and A2 is independently an aliphatic, 
heteroaliphatic, aromatic, heteroaromatic, aryl, heteroaryl or a pharmaceutically 
useful group or entity; 

R^^ is a sulfide protecting group; 
5 is a group such that the moiety -C(=0)OR^° can be made to undergo 

ligation with the peptide amine acceptor; 

each occurrence of is independently a substituted or unsubstituted, linear 
or branched, cyclic or acyclic, saturated or unsaturated aliphatic or heteroaliphatic 
moiety; 

10 R^^ is hydrogen, alkyl, acyl, aromatic, heteroaromatic, aryl, heteroaryl, - 

alkyl(aryi), -alkyl(heteroaryl), a nitrogen protecting group, an amino acid or a 
proctected amino acid; and 

R^ is -OR^' or -NR^^R^°, wherein R^' is hydrogen, alkyl, aromatic, 
heteroaromatic, aryl, heteroaryl, -alkyl(aryl), -alkyl(heteroaryl), a carboxylic acid 

15 protecting group, an amino acid or a proctected amino acid; and R^^ and R'^^ are 
independently hydrogen, alkyl, aromatic, heteroaromatic, aryl, heteroaryl, - 
alkyl(aryl), -alkyl(heteroaryl), a nitrogen protecting group, an amino acid or a 
proctected amino acid. 

[0055] In certain embodiments, each occurrence of A is independently a 

20 pharmaceutically useful group or entity. In certain embodiments, each occurrence of 
A is independently a biomolecule, a small molecule, a macromolecule or a 
diagnostic label. 

[0056] In certain exemplary embodiments, each occurrence of A is 

independently a carbohydrate determinant having the structure: 


25 



wherem a, b, c, d, e, f, g, h, i, x, y and z are independently 0, 1, 2 or 3, with 
the proviso that the x, y and z bracketed structures represent furanose or pyranose 
moieties and the sum ofb and c is 1 or 2, the sum of d and f is 1 or 2, and the sum of 
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g and i is 1 or 2, and with the proviso that x, y and z are not simultaneously 0; 
wherein Ro is hydrogen, a linear or branched chain alkyl, acyl, arylalkyl or aryl 
group; wherein each occurrence of Ri, R2, R3, R4, Rs* R^ , R7, Rs and R9 is 
independently hydrogen, OH, OR*, NHR*, NHCOR', F, CH2OH, CH2OR*, a 
5 substituted or unsubstituted linear or branched chain alkyl, (mono-, di- or 
tri)hydroxyalkyl, (mono-, di- or tri)acyloxyalkyl, arylalkyl or aryl group; wherein 
each occurrence of R' is independently hydrogen, CHO, COOR", or a substituted or 
unsubstituted linear or branched chain alkyl, acyl, arylalkyl or aryl group or a 
saccharide moiety having the structure: 



wherein Y and Z are independently NH or O; wherein k, 1, r, s, t, u, v and w 
are each mdependently 0, 1 or 2; with the proviso that the v and w bracketed 
structures represent furanose or pyranose moieties and the sum of 1 and k is 1 or 2, 
and the sum of s and u is 1 or 2, and with the proviso that v and w are not 

15 simultaneously 0; wherein R'o is hydrogen, a linear or branched chain alkyl, acyl, 
arylalkyl or aryl group; wherein each occurrence of Rio, Rii, R12, Ri3» Ri4 and R15 is 
mdependently hydrogen, OH, OR"', NHR'", NHCOR'", F, CH2OH, CH2OR"*, or a 
substituted or unsubstituted linear or branched chain alkyl, (mono-, di- or 
tri)hydroxyaIkyl, (mono-, di- or tri)acyloxyalkyl, arylalkyl or aryl group; wherem 

20 each occurrence of Rie is hydrogen, COOH, COOR", CONHR", a substituted or 
unsubstituted linear or branched chain alkyl or aryl group; wherein each occvirrence 
of R'" is hydrogen, CHO, COOR''', or a substituted or unsubstituted linear or 
branched chain alkyl, acyl, arylalkyl or aryl group; and wherein each occurrence of 
R" and R*^ are each independently H, or a substituted or imsubstituted linear or 

25 branched chain alkyl, arylalkyl or aryl group. 

[0057] In certain embodiments, may comprise any functional moiety that 

is compatible with native chemical ligation reaction conditions. In certain 
embodiments, may comprise any functional moiety that is compatible with 
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aqueous conditions. In certain embodiments, a compatible functionality is one that 
is stable, unreactive and/or minimally interferes with the reaction. A thiol group is 
considered a compatible functionality, even though a thiol group may slow down the 
reaction. Examples of suitable functionalities include, but are not limited to, 
5 hydrocarbons, amines, amides, imines, hydroxyls, ethers, carboxylic esters, 
aldehydes, thiols, olefms, alkynes, aryls and heteroaryls. In certain exemplary 
embodiments, does not comprise a thiol group. 

[0058] In certain embodiments, each occurrence of is independently -O- 

(CH2)n-, wherein n is 0-9, or a glycoside-containing moiety (e.g mono- or poly- 
10 saccharide). 

[0059] In certain embodiments, each occurrence of is -~0-(CH2)n-CH2- 

and two or more non-adjacent amino acids is/are independently substituted with a 
moiety having the structure: 

15 wherein each occurrence of n is independently 0-8; and A is as defined 

above. In certain other embodiments, A is a carbohydrate moiety and each 
occurrence of the n-alkyl glycosidic moiety is independently either a- or p-linked to 
an amino acid residue of the backbone. It will be appreciated that 
polyfunctionalized peptides made according to the method of the present invention 

20 are not limited to those where each occurrence of comprises n-alkyl where n is 
greater than or equal to 1; rather each occurrence of A can be independently linked 
via the traditional direct linkage (n=0), via n-alkyl (such as pentyl), via a mono- 
saccharide moiety or any combination thereof In preferred embodiments, each 
occurrence of A is independently selected from the group consisting of Globo-H, 

25 fucosyl GMl, KH-1, glycophorin, STN, (2,3)ST, Le^, Le^ N3, Tn, 2,6-STn, Gb3 
andTF. 

[0060] In certain embodunents, occurrences of A may be the same or 

different. 

[0061] In certain other embodiments, certain occurrences of A are clustered, 

30 For example, in certain embodiments, the polyfunctionalized peptide comprises at 
least two adjacent fiinctionalized amino acids (/.e., clustered functionalized amino 
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acids), separated from another functionalized site by a peptide amino acid sequence 
comprising at least one cysteine residue* 

[0062] In certain other embodiments, certain occurrences of A are clustered 

glycosides. For example, in certain embodiments, the multiglycosylated peptide 
comprises at least two adjacent glycosylated amino acids (/.e., clustered glycosylated 
amino acids), separated from another glycosylated site by a peptide amino acid 
sequence comprising at least one cysteine residue. 

[0063] In certain embodiments, the present invention provides a method for 

preparing a polyfunctionalized peptide having the structure: 



si and s2 are independently an integer from 1 to about 20; 
tl, t2 and t3 are each independently an integer; 

R'^^ is hydrogen, alkyl, acyl, aromatic, heteroaromatic, aryl, heteroaryl, - 
alkyl(aryl), -alkyl(heteroaryI), a nitrogen protecting group, an amino acid or a 
proctected amino acid; 

rX2 ig _0R^* or -NR^^^^ wherein R^* is hydrogen, alkyl, aromatic, 
heteroaromatic, aryl, heteroaryl, -alkyl(aryl), -alkyl(heteroaryl), a carboxylic acid 
protecting group, an amino acid or a proctected amino acid; and R^** and R^*^ are 
independently hydrogen, alkyl, aromatic, heteroaromatic, aryl, heteroaryl, - 
alkyl(aryl), -alkyl(heteroaryl), a nitrogen protecting group, an amino acid or a 
proctected amino acid; 

R^^ and R^^ are independently H, alkyl, heteroalkyi, aromatic, 
heteroaromatic, aryl, heteroaryl, -alkyl(aryl), -alkyI(heteroaryl), or a natural or non- 
natural amino acid side chain; 

each occurrence of is independently a substituted or unsubstituted 
aliphatic or heteroaliphatic moiety; 
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Ai and A2 are each independently an aliphatic, heteroaliphatic, aromatic, 
heteroaromatic, aryl, heteroaryl or a pharmaceutically useful group or entity; and 

at least one occurrence of the bracketed structure t2 is a cysteine residue or 
protected cysteine residue; 

wherein the method comprises a step of: 

reacting a peptide acyl donor having the structure: 



with a peptide amine acceptor having the structure: 



10 under suitable conditions to effect ligation; 

wherein the sum t+t' equals (t2)+l ; R^4s a sulfide protecting group; and R'^*^ 
is a group such that the moiety -C(=0)OR^° can be made to imdergo ligation with 
the glycopeptide amine acceptor. 

In certain exemplary embodiments, Ai and A2 are each independently a 
1 5 carbohydrate domain having the structure: 


R8 

R7 R9 


R4 Re 


wherein a, b, c, d, e, f, g, h, i, x, y and z are independently 0, 1, 2 or 3, with 
the proviso that the x, y and z bracketed structures represent furanose or pyranose 
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moieties and the sum of b and c is 1 or 2, the sum of d and f is 1 or 2, and the sum of 


wherein Ro is hydrogen, a linear or branched chain alkyl, acyl, arylalkyl or aryl 
group; wherein each occurrence of Ri, R2> Ra, R4. Rs, R6 , R?* a^^d R9 is 

5 independently hydrogen, OH, 0R^ NHR*, NHCOR*, F, CH2OH, CHaOR^ a 
substituted or unsubstituted linear or branched chain alkyl, (mono-, di- or 
tri)hydroxyalkyl, (mono-, di- or tri)acyloxyalkyl, arylalkyl or aryl group; wherein 
each occurrence of R* is independently hydrogen, CHO, COOR", or a substituted or 
unsubstituted linear or branched chain alkyl, acyl, arylalkyl or aryl group or a 

1 0 saccharide moiety having the structure: 


wherein Y and Z are independently NH or O; wherein k, 1, r, s, t, ui, v and w 
are each independently 0, 1 or 2; with the proviso that the v and w iDracketed 
structures represent fiiranose or pyranose moieties and the sum of 1 and k is 1 or 2, 

15 and the sum of s and u is 1 or 2, and with the proviso that v and w are not 
simultaneously 0; wherein R'o is hydrogen, a linear or branched chain alkyl, acyl, 
arylalkyl or aryl group; wherein each occurrence of Rio, Rii, R12, R13, Ri4 and R15 is 
independently hydrogen, OH, OR'", NHR"\ NHCOR'", F, CH2OH, CH2OR*", or a 
substituted or unsubstituted linear or branched chain alkyl, (mono-, di- or 

20 tri)hydroxyalkyl, (mono-, di- or tri)acyloxyalkyl, arylalkyl or aryl group; wherein 
each occurrence of R16 is hydrogen, COOH, COOR**', CONHR", a substituted or 
unsubstituted linear or branched chain alkyl or aryl group; wherein each occurrence 
of R'" is hydrogen, CHO, COOR'^ or a substituted or unsubstituted linear or 
branched chain alkyl, acyl, arylalkyl or aryl group; and wherein each occxirrence of 

25 R" and R'"" are each independently H, or a substituted or unsubstituted linear or 
branched chain alkyl, arylalkyl or aryl group; and wherein each glycosidic moiety is 
either a- or |3-linked to an amino acid. 


g and i is 1 or 2, and with the proviso that x, y and z are not simultaneously 0; 
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10064] In certain embodiments, the step may be carried out once, or repeated 

a desired number of times, to prepare a polyfunctionalized peptide having the 
structure: 



wherein R^^ and are as defined above; 

each occurrence of A may be the same or different and may be as defined for 
Ai and A2 above; 

each occxirrence of R^^ may be the same or different and may be as defined 
for R^^ and R^ above; 

q is an integer greater than or equal to 2; 

each occurrence of s is independently an integer from 1 to about 20; 
each occurrence of t is independently an integer; 
to is an integer; and 

each occurrence of R^° is independently H, alkyl, heteroalkyl, aromatic, 
heteroaromatic, aryl, heteroaryl, -alkyl(aryl), -alkyl(heteroaryl), or a natural or non- 
natural amino acid side chain. 

[0065] In certain embodiments, q is an integer between 2 and about 5. In 

certain embodiments, q is an integer between 2 and about 10. In certain 
embodiments, q is an integer between 2 and about 15. In certain embodiments, q is 

) an integer between 2 and about 20. In certain embodiments, q is an integer between 
2 and about 25. In certain embodiments, q is an integer between 2 and about 30. In 
certain embodiments, q is an integer greater than 30. In certain embodiments, q is 2. 
[0066] In certain embodiments, the sum s+t is between about 2 and about 6. 

In certain embodiments, the sum s+t is between about 2 and about 10. In certain 

5 embodiments, the sum s+t is between about 2 and about 15. In certain 
embodiments, the sum s+t is between about 2 and about 20. In certain 
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embodiments, the sum s+t is between about 2 and about 50. In certain 
embodiments, the sum s+t is between about 2 and about 100. In certam 
embodiments, the sum s+t is between about 2 and about 150. In certain 
embodiments, the sum s+t is between about 2 and about 200. In certain 

5 embodiments, the sum s+t may be greater than 200. 

[0067] In certain embodiments, tO is an integer from 0 to about 2. In certain 

embodiments, tO is an integer from 0 to about 5. In certain embodiments, tO is an 
integer from 0 to about 10. In certain embodiments, tO is an integer from 0 to about 
15. In certain embodiments, tO is an integer from 0 to about 20. In certam 

10 embodiments, tO is an integer from 0 to about 25. In certain embodiments, tO is an 
integer from 0 to about 30. In certain embodiments, tO is an integer from 0 to about 
50. In certain embodiments, tO is an integer from 0 to about 100, In certain 
embodiments, tO is an integer from 0 to about 150. In certain embodiments, tO is an 
integer from 0 to about 200. In certain embodunents, tO is an integer greater than 

15 200. 

[0068] In certain other embodiments, R^^ is hydrogen, Fmoc or Ac. 

[0069] In certain other embodiments, is NH2. 

[0070] In certain other embodiments, R^° is a sulfur-substituted aryl moiety. 

In certain embodiments, R^° is a disulfide-substituted aryl moiety. In certain 
20 embodiments, R^° has the structure: 


-r-SSR 


wherein R is an aliphatic, heteroaliphatic, aromatic or heteroaromatic moiety. 
In certain exemplary embodiments, R^*^ has the structure: 




RSS 

25 [0071] In certain embodiments, R is lower alkyl. In certain exemplary 

embodiments, R is ethyl. 

[0072] In certam other embodiments, R^* is -S/Bu. 
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[0073] In certain exemplary embodiments, in the step of reacting the peptide 
acyl donor having the structure: 



with the peptide amine acceptor under suitable conditions to effect ligation, 
5 an intermediate having the following structure is formed in situ: 



XOa 


wherein R^^* is an oxygen-substituted aryl moiety. 
[0074] In certain embodiments, the suitable conditions to effect ligation 

comprise MES-Na. 

10 [0075] In certain exemplary embodiments, in the peptide acyl donor havmg 

the structure: 



the amino acyl residue directly attached to -OR^^ is phenylalanine. 
[0076] In certain embodiments, A may comprise any functional moiety that 

15 is compatible with native chemical ligation reaction conditions. In certain 
embodiments, A may comprise any functional moiety that is compatible with 
aqueous conditions. In certain embodiments, a compatible functionality is one that 
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is stable, unreactive and/or minimally interferes with the reaction. A thiol group is 
considered a compatible fimctionality, even though a thiol group may slow down the 
reaction. Examples of suitable functionalities include, but are not limited to, 
hydrocarbons, amines, amides, imines, hydroxyls, ethers, carboxylic esters, 
5 aldehydes, thiols, olefins, alkynes, aryls and heteroaryls. In certain exemplary 
embodiments, A does not comprise a thiol group. 

[0077] In certain embodiments, when at least one occurrence of A (or Ai 

and/or A2, as further defined for A) is a carbohydrate domain, some or all of 
carbohydrate domains are 0-Iinlced to the peptide backbone, hi certain other 
10 embodiments, when at least one occurrence of A (or Ai and/or A2, as further defined 
for A) is a carbohydrate domain, some or all of carbohydrate domains are N-linlced 
to the peptide backbone. In certain other embodiments, when at least one 
occurrence of A (or Ai and/or A2, as further defined for A) is a carbohydrate 
domain, the inventive method may be praticed while the carbohydrate domain is 
15 partially or fiilly deprotected (i.e., comprises exposed OH groups). In certain 
embodiments, the peptide sequence between each point of attachment of the A 
moieties comprises a cysteine residue. In certain embodiments, the 
polyfimctionalized construct (/.e., construct bearing more than one A moiety) is 
prepared by Native Chemical Ligation. hi certain embodiments, the 
20 polyfimctionalized peptides obtained by the inventive method are symmetrical 
(functionalized peptide building blocks to be ligated have the same peptide sequence 
(safe for N- and/or C-terminals) and bear the same functionalization pattern), 
nonsymmetrical (functionalized peptide building blocks to be ligated differ in 
peptide sequence (excluding N- and C-terminals) and/or functionalization pattern). 
25 In certain embodhnents, the inventive method allows the preparation of 
multiglycosylated peptides designed to approximate the spatial position(s) of 
carbohydrate(s) in glycoprotein/glycopeptides of interest (e.g., naturally occurring 
glycoproteins such as gpl20 and erythropoietin). 

[0078] In certain embodiments, the method fiirther comprises a step of 

30 conjugating the polyfunctionalized peptide to an immunogenic carrier. In certain 
exemplary embodiments, the carrier is a protem, a peptide or a lipid. In certain other 
exemplary embodiments, the carrier is Bovine Serum Albumin (BSA), Keyhole 
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Limpet Hemocyanin (KLH) or polylysine. In certain other embodiments, the carrier 
is a lipid carrier having the structure: 



wherein m, n and p are each independently integers between about 8 and 20; 
5 and Rv is hydrogen, substituted or unsubstituted linear or branched chain lower alkyl 
or substituted or unsubstituted phenyl. In certain exemplary embodiments, ra', n' 
and p' are each 14 and the lipid is tripalmitoyl-S-glycerylcysteinylserine (e.g., 
PamCys). 

[0079] It will be appreciated that the carrier can be linked to the 

10 polyfimctionalized peptide either directly or through a crosslinker, and thus the 
peptide may be attached to a protein, peptids, or lipid, as well as a (crosslinker- 
protein), (crosslinker-peptide) and (crosslinker-lipid) moiety. 
[0080] Crosslinkers suited to the invention are widely known in the art (see, 

for example, 1994 Pierce Technical Handbook: cross-linking (Appendix A), which 
15 is available at http://www.piercenet.com/ resources/browse,cfm?fldID=184), 
including bromoacetic NHS ester, 6-(iodoacetamido)caproic acid NHS ester, 
maleimidoacetic acid NHS ester, maleimidobenzoic acide NHS ester, etc. In certain 
preferred embodiments, the crosslinker is MMCCH (4-(maleimidomethyl) 
cyclohexane-l-carboxyl hydrazide). In certain other preferred embodiments, the 
20 crosslinker is MBS (m-maleimidobenzoyl acid N-Hydroxysuccinimidyl ester). In 
certain embodiments, the crosslinker is a fragment having the structure: 
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whereby said structure is generated upon conjugation of a maleimidobenzoic 
acid N-hydroxy succinimide ester with a suitable functionality on the 
polyfunctionalized peptide. 

[0081] 2) Exemplary Synthetic Methodology 

5 [0082] Native Chemical Ligation 

[0083] One of the more widely used methods for the synthesis of 

glycopeptides is native chemical ligation (NCL).^^ First reported by Kent in 1994, 
NCL allows for the assembly of large proteins with native amide bonds from 
unprotected peptide building blocks (Scheme 1), Furthermore, the reaction is mild, 

10 selective, and compatible with the presence of glycans. When glycans are present in 
the reaction, they are typically found on the C-terminal side. In the event, a 
glycopeptide containing a C-terminal cysteine undergoes a chemoselective reaction 
with a peptide thioester. The resulting peptide thioester then rearranges 
spontaneously to furnish a native peptide bond, effectively lengthening the peptide 

1 5 backbone of the glycopeptide. 

[0084] Scheme 1. Exemplary Native Chemical Ligation methodology 

o tps?f:s 

reaction O j 1 i 



spontaneous 
rearrangement 


o r 

" O 


^SH Y 


[0085] As discussed above, glycopeptides bearing multiple glycosidic 

domains are of particular interest. Such structures include the synthesis of gpl20 

20 fragments, erythropoietin (EPO), human chorionic gonadotropin (HCG), and 
amyloid precursor protein (APP). Drawing from previous studies in the area,"*^ NCL 
was explored as a way to couple two small glycopeptides to form one large 
glycopeptide. One drawback that became apparent lies in the difficulty with which 
unprotected glycopeptide a-thioesters are synthesized.^^ Accordingly, a new 

25 method for the synthesis of glycopeptide a-thioesters, or their equivalent, was 
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devised, to allow the assembly of glycopeptides bearing multiple glycosidic 
domains. 

[0086] A closer look at the reaction mechanism of NCL provided clues as to 

how this problem might be approached. In one embodiment of the reaction itself, 
the sulfhydryl group of the cysteine coupling partner, initially protected as a 
butyldisulfide, is released by disulfide exchange with 2-mercaptoethanesulfonic acid 
(MESNa) and then unilergoes trans-thioesterification as mentioned above. Without 
wishing to be bound to any particular theory, it was thought that it might be possible 
to mask the a-thioester as a phenyl ester, bearing an ethyldisulfide in the ortho 
position (Scheme 2) by taking advantage of this disulfide exchange. Under the 
reaction conditions, the free sulfhydryl would be released and undergo a 
spontaneous rearrangement to furnish a thioester capable of entering into the NCL 
pathway. 


[0087] Scheme 2, A new method for the generation of peptide a-thioesters. 

NCL: 



disulfide 


[0088] To test the viability of this proposed modification, the simple 

dipeptide, PheCys, was synthesized. The desired arrAo-substituted phenol was 
synthesized in two steps from commercially available 2-mercaptophenol by 
oxidation and disulfide exchange. Coupling of the phenol to phenylalamne followed 
by the removal of the Boc protecting group set the stage for our first attempt 
(Scheme 3). The reaction proceeded in good yield. More importantly no 
racemization was observed under the reaction conditions. 
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10 


[0089] Scheme 3. Exemplary NCL with masked thioester 


1) Boc-L-Phe-OH, 


a OH 1) I2, MeOH, H2O i^^''^^^" EPCI> DMAP 



o 


OH 


HS^ 


MESNa 

SSEt pH7.5 



78% yield 
No observed 
racemlzatlon 

[0090] The synthesis of glycopeptides bearing two giycosidic domains was 

then undertaken. Synthesis of the //-terminal fragment (the fragment that would 
ultimately become the masked thioester) began with the soUd phase synthesis of a 
peptide acid, followed by standard peptide coupling to introduce the phenylalanine 
bearing the phenyl ester. TFA deprotection of the side chain protecting groups 
ftimished the peptide backbone for the iV^terminal fragment (Scheme 4). 
[0091] Scheme 4. Exemplary synthesis of the peptide backbone of the N- 

terminal fragment 


^MH NH 
Pbr Pbf 



15 


) 
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[0092] Coupling of an oligosaccharide with excess peptide furnished a fully 

deprotected glycopeptide ready for ligation (Scheme 5). 

[0093] Scheme 5. Exemplary approach to the completion of the AT-tenninal 

fragment 


C T 

NH ^NH 


8Et 


-NH2 


HATU 

DJEA 60-70% 
DMSO 


°( 

NH ^NH 
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[0094] With the iV-terminai portion in hand, the synthesis of the C-terminal 

coupling partner was then carried out. In a similar way, the peptide backbone was 
10 prepared by solid phase peptide synthesis. Upon cleavage from the resin, 
glycosylation and Fmoc removal, the C-terminal glycopeptide was obtained 
(Scheme 6). 

[0095] Scheme 6. Exemplary synthesis of the C-terminal glycopeptide 
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HjN^ peptide synthesis j 8 J T [) X ^ 

OH 

HATU 
OIEA 
DMSO 

[0096] In certain exemplary embodiments, the synthesis of glycopeptides 

bearing two glycosydic domains was accomplished as follows: equal molar amounts 
of both the iV-terminal and C-terminal glycopeptides were combined in a LCMS vial 

5 and to this was added a solution of MESNa in phosphate buffered saline. The 
reaction was monitored by LCMS and to our amazement, the rearrangement of the 
phenyl ester to thioester occurred quite rapidly,^ ^ additionally mass peaks 
corresponding to the desired product were observed almost instantly. Upon 
completion of the reaction, any remaining disulfide bonds were reduced by the 

10 action of tris(2-carboxyethyl) phosphine hydrochloride (TCEP). The crude reaction 
mixture was then subjected to HPLC for purification. The reaction proceeded 
extremely well, providmg doubly glycosylated glycopeptides, typically in 60-70% 
yield. Exemplary di-glycosylated peptides that may be prepared by the method of 
the present invention are detailed in the Exemplification below. It should be noted 

15 that symmetrical, nonsymmetrical and mixed (iV-linked and 0-Iinked) glycopeptides 
can be used. 

[0097] Accordingly, there is provided herein a method for preparing 

glycopeptides comprising at least two carbohydate domains covalently attached 
thereto. In certain embodiments, some or all of carbohydrate domains are 0-linked. 
20 In certain other embodiments, some or all of carbohydrate domains are N-linked. In 
certain embodiments, the glycopeptide comprises two or more carbohydate domains 
covalently attached thereto, wherein the glycopeptide sequence between each point 
of attachment of the carbohydrate domains comprises a cysteine residue. In certain 
embodiments, the mutli-glycan glycopeptide is prepared by Native Chemical 


then 
20% plperdlne 
In DMF 
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Ligation. In certain embodiments, the method allows for coupling where each 
coupling partner is a glycopeptide itself. Symmetrical, nonsymmetrical and mixed 
(N-linked and 0-lmked) glycopeptides can be obtained. In certain embodiments, the 
method involves the in situ generation of a thioester that is then used unmediately in 
5 native chemical ligation. 

[0098] Peptide Thioester Synthesis 

[0099] Several methods have been developed for peptide thioester synthesis, 

including the original "Boc chemistry" (Boc = rerNbutyloxycarbonyl) method (See, 
for example, (1) Canne, L. E.; Walker, S. M.; Kent, S. B. H. "A General Method for 
10 the Synthesis of Thioester Resin Linkers for Use in the Solid-Phase Synthesis of 
Peptide Alpha-Thioacids." Tetrahedron Lett. 1995, 36, 1217-1220; and (2) Hojo, H.; 
Aimoto, S, "Polypeptide Synthesis Using the S-Alkyl Thioester of a Partially 
Protected Peptide Segment-Synthesis of the DNA- Binding Domain of C-Myb 
Protein (142-193)-NH2." Bull Chem, Sac, Jpn. 1991, 64, 111-117) and several 
15 Fmoc-compatible systems (See, for example, (1) Shin, Y.; Winans, K. A.; Backes, 
B. J.; Kent, S. B. H.; Elhnan, J. A.; Bertozzi, C. R. 'Tmoc-based synthesis of 
peptide-(alpha)thioesters: Application to the total chemical synthesis of a 
glycoprotein by native chemical ligation." 1 Am, Chem, Soc, 1999, 121, 11684- 
11689; (2) Ingenito, R.; Bianchi, E.; Fattori, D.; Pessi, A. "Solid phase synthesis of 
20 peptide C-terminal thioesters by Fmoc/t-Bu chemistry." J. Am. Chem, Soc, 1999, 
12U 11369-11374; (3) Li, X. Q.; Kawakami, T.; Aunoto, S. "Direct preparation of 
peptide thioesters using an Fmoc solidphase method." Tetrahedron Lett. 1998, 39, 
8669-8672; (4) Clippmgdale, A. B.; Barrow, C. J.; Wade, J. D. "Peptide thioester 
preparation by Fmoc solid phase peptide synthesis for use in native chemical 
25 ligation." J. Pept, Sci 2000, 6, 225-234; and (5) Bu, X. Z.; Xie, G. Y.; Law, C. W.; 
Guo, Z. H. "An improved deblocking agent for direct Fmoc solidphase synthesis of 
peptide thioesters." Tetrahedron Lett. 2002, 43, 2419-2422). In ceratin 
embodiments, the model thioester is a C-terminal glycine thioester, which is locally 
achiral and caimot be epimerized, and is therefore easy to synthesize. Though the 
30 desired GP120 thioester contains an epimerization-prone C-terminal histidine (His) 
residue, such tliioesters have been synthesized previously and have in fact been 
shown to modulate favorably the rate of NCL (See, for example, Hackeng, T. M.; 
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GrifFin, J, H.; Dawson, P. E. "Protein synthesis by native chemical ligation: 
Expanded scope by using straightforward methodology." Proc, Natl Acad. Set a S. 
A, 1999, 96, 10068-10073). 

[001001 It is understood that, although the discussion above regarding Native 
5 Chemical Ligation focuses on the preparation of multiglycosylated peptides, the 
method may readily be adapted to other polyfimctionalized peptides and/or proteins. 
For example, peptides and/or proteins functionalized at more than one amino acid 
site with a pharmaceutically useful group or entity may be prepared by the method 
of the invention. 

Equivalents 

[00101] The representative examples which follow are intended to help 
illustrate the invention, and are not intended to, nor should they be construed to, 
limit the scope of the invention. Indeed, various modifications of the invention and 

15 many further embodiments thereof, in addition to those shown and described herein, 
will become apparent to those skilled in the art from the full contents of this 
document, including the examples which follow and the references to the scientific 
and patent literature cited herein. In but one illustrative example, protecting groups 
play an important role in the syntliesis of the carbohydrate domains and synthetic 

20 conjugates, as described herein; however it will be appreciated by one of ordinary 
skill in the art that the present invention encompasses the use of various alternate 
protecting groups known in the art. Those protecting groups used in the disclosure 
including the Examples below are merely illustrative. 

[00102] It should further be appreciated that, uless otherwise indicated, the 
25 contents of those cited references are incorporated herein by reference to help 
illustrate the state of the art. The following examples contain important additional 
information, exemplification and guidance which can be adapted to the practice of 
this invention in its various embodiments and the equivalents thereof. 


30 Exemplification 

[00103] The method of this invention can be understood further by the 
examples that illustrate some of the processes by which the inventive method may 
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be practice. It will be appreciated, however, that these examples do not limit the 
invention. Variations of the invention, now known or further developed, are 
considered to fall within the scope of the present invention as described herein and 
as hereinafter claimed. 

5 [00104] 1) General Description of Synthetic Methods: 

[00105] The practitioner has a a well-established literature of peptide, protein 
and glycoside chemistry to draw upon, in combination with the mformation 
contained herein, for guidance on synthetic strategies, protecting groups, and other 
materials and methods useful for practicing the method of this mvention. 

10 [00106] The various references cited herein provide helpful backgroimd 
information on preparing complex glycosides, glycosylated peptides and other 
glycosylated constructs that may be applied and/or adapted to the method of the 
present invention. 

[00107] According to the present invention, any available techniques can be 
15 used to practice the method of the invention. For example, a variety of solution 
phase synthetic methods such as those discussed in detail below may be used. 
Alternatively or additionally, a variety combinatorial techniques, parallel synthesis 
and/or solid phase synthetic methods known in the art may be used. 
[0100] Starting materials and reagents used in practicing the specific 

20 embodiments of the invention detailed below and herein are either available from 
commercial suppliers such as Aldrich Chemical Company (Milwaukee, WI), 
Bachem (Torrance, CA), Sigma (St. Louis, MO), or are prepared by methods well 
known to a person of ordinary skill in the art following procedures described in such 
references as Fieser and Fieser 1991, "Reagents for Organic Synthesis", vols 1-17, 
25 John Wiley and Sons, New York, NY, 1991; Rodd 1989 "Chemistry of Carbon 
Compounds", vols. 1-5 and supps, Elsevier Science Publishers, 1989; "Organic 
Reactions", vols 1-40, John Wiley and Sons, New York, NY, 1991; March 2001, 
"Advanced Organic Chemistry^*, 5th ed. John Wiley and Sons, New York, NY; and 
Larock 1990, "Comprehensive Organic Transformations: A Guide to Functional 
30 Group Preparations", 2"'' ed. VCH Pubhshers. These schemes are merely illustrative 
of some embodiments of the present mvention, and various modifications to these 
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schemes can be made and will be suggested to a person of ordinary skill in the art 
having regard to this disclosure. 

[0101] In practicing the invention, starting materials, intermediates, and 

compounds may be isolated and purified using conventional techniques, including 
5 filtration, distillation, crystallization, chromatography, and the like. They may be 
characterized using conventional methods, including physical constants and spectral 
data. 

[0102] Methods for preparing glycopeptides {e.g., O- or N-linked 

glycopeptides) and for conjugating peptides and glycopeptides to carriers are known 

10 in the art. For example, guidance may be found in U.S. Patent No.: 6,660,714; U.S. 
Patent Application Nos.: 09/641,742; 10/209,618; and 10/728,041; U.S. Provisional 
Patent Application Nos.: 60/500,161 and 60/500,708; and International Patent 
Application Nos.: PCT/ US03/38453 and PCT/ US03/38471; each of the above- 
referenced patent documents are hereby incorporated by reference herein. 

15 [0103] Overview 

[0104] Following extensive earlier studies, wherein our chemistry was 

interfaced with important fmdings from other laboratories,^^ we demonstrated the 
ability to ligate fully synthetic iV-linked glycopeptides containing various complex 
oligosaccharides to larger polypeptide domains. For example, major glycosylated 

20 subsections of prostate specific antigen (PSA)^' and SpUO^"^ could be obtained. 
Provided herein is an exemplary convergent approach to access polypeptides 
multiply glycosylated at designated sites. As discussed previously, glycoproteins 

2 • * 3 

have potentially important clinical roles in the contexts of vaccines, diagnostics, 
and therapeutics.'* The method of the present invention thus has important 
25 applications m the field of medicme since it offers access to complex therapeutically 
useful glycoproteins. 

[0105] Scheme 7 schematizes an embodiment of the present invention. For 

example, glycopeptides 4 and 7 may be generated in a convergent way from their 
respective glycan and peptide precursors, and may be coupled with full convergence 
30 to reach 8. 

[0106] Scheme 7. A convergent synthesis of bifunctional glycopeptides. 
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(lalycan = complex oligosaccharide ) 
[0107] In one embodiment, the synthesis of a suitable glycopeptide acyl 

donor for Ugation with 7 was investigated. One approach was based on utilizing 
buUding block 4, in the form of a thioester, as acyl donor. However, the preparation 

5 of a thioester such as 4 by convergent means'^ appeared to present difficulties. 

[0108] It was in pondering this problem that an attractive possibility 

presented itself (Scheme 8). The synthesis of a phenolic ester equipped with an 
unsymmetrical disulfide (cf. 10) was then investigated. The phenolic ester linkage 
would in itself probably not manifest sufficient reactivity to serve as a viable 

10 acylating agent. However, equipped as it would be with an ortfto-disulfide moiety, 
the phenolic ester could operate as an incipient acyl donor. Thus, reduction of the 
disulfide in 12, as shown, might well set the stage for in situ elaboration into a 
tiiioester {cf. 13->-14) by 0->S acyl migration. Thioester 14, present m an 
unfevorable, but dynamic equilibrium, could perhaps be interdicted by the 

15 machinery appropriate to native chemical ligation (NCL),'^'' ultimately culminating 
in 8. 

[0109] While the proposal for the intermediacy of a discreet thioester 14 is 

certainly attractive, it is not necessarily an obligatory event en route to ligation. 
Thus, as suggested in structure 13, the presence of a firee orf/zo-benzene thiol 
20 ftmction might well enhance the acylating facility of the phenolic ester," perhaps 
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through in situ hydrogen bonding or anchimeric assistance. In either case, the 
sequence is set into motion only upon reductive cleavage of the disulfide. 
[0110] The synthesis of the masked glycopeptide acyl donor was 

accomplished as follows (Scheme 8). Commercially available 2-mercaptophenol (9) 
was oxidized to its symmetrical disulfide. Subsequent exchange with excess ethyl 
disulfide^^ furnished an unsymmetrical, aryl-alkyl disulfide. Acylation of the 
phenolic hydroxyl group with Boc-phenylalanine followed by acidic cleavage of the 
Boc protecting group afforded phenylalanine derivative 10. Standard peptide 
coupling of 10 with a fully protected peptide acid^^ followed by global deprotection 
provided 11,^° which contains a fi-ee aspartate residue. Aspartylation of a 
chitobiose-derived glycosylamine with 11 afforded the virtual acyl donor 12, 
corresponding to our target structure 4 (Scheme 7). 

[0111] Scheme 8. A structure, synthesis, and proposed mechanism of a 

masked glycopeptide acyl donor.^ 


HS. 


HO' 


a-d 


EtSS, 


OH 


[^Glycan^NH 

........ 



Scheme 1 


pH-'7.4, 

excess 
R-SH 

disuifide 
exchange 

intermoiecular 
acyl transfer 


CGIycan]i-NH 


0 EtSS. 
Ph^' 11 


EtSS 
PUT' 12 


(R-SH) 


H]N 


SR 


Ph^15 


5 


(^Glycan^ = chitobiose in 12-15; CPeptld^ = FmocRNRSG- In 12-15; R = CH2CH2S03Na 
[0112] " Reagents and conditions: (a) h, MeOH, H2O; (b) BF3-OEt2, 

EtSSEt, CH2CI2, 99%, 2 steps; (c) Boc-Phe-OH, EDCI, DMAP, CH2CI2/THF, 93%; 
(d) 4N HCiydioxane, 94%; (e) Fmoc-Arg(Pbf)-Asp('Bu)-Arg(Pbf)-Ser('Bu).Gly-OH, 
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HATU. DIEA, DMF, 61%; (f) TFA:phenol:Et3SiH:H20 35:2:1:1, 60%; (g) 
GlcNAcpl-^GlcNAcp-NH2, HATU, DIEA, DMSO, 52%. 
[01131 As a preliminary probe of the practicability of this scheme, 

phenylalanine derivative 10 was subjected to the neutral, aqueous, reducing reaction 
5 conditions described above, along with free cysteine (Scheme 9). The desired free 
dipeptide Phe-Cys (17) was obtained in 78% isolated yield. Only a single product 
was observed by LCMS and 'H NMR, strongly suggesting that epimerization at the 
a-carbon of phenylalanine did not occur. 

[0114] Scheme 9. An example of the latent acyl donor function. 

Etss^^ ^SH excess ^ ^SH 

i 8 PH~7.4 " o 

^ 10 16 78% yield 17 

[0115] The synthesis of bifonctional glycopeptides was then carried out. 

Thus, 12 was treated with excess sodium 2-mercaptoethanesulfonate (MES-Na) at 

neutral pH in aqueous phosphate buffered saline (PBS) in the presence of 18, 

containing a latent N-terminal cysteine residue (Scheme 10). As observed by LCMS 

15 during the reaction, 12 was almost instantaneously (less than 3 min) converted to the 
thioester derived from MES-Na (c/ 15); very little hydrolysis to the free carboxylic 
acid was noted. Over the course of the next several hours, we observed a gradual 
decrease in the amount of this thioester, with a concomitant increase in the desired 
product 19. Only one compound of the appropriate mass was observed by LCMS 

20 and NMR, again indicating a lack of epimerization. 

[0116] Scheme 10. Exemplary convergent coupling of functionalized 
glycopeptides. 
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Fmoc-Arg-Asn-Arg-Ser-Gly "N^^, 



Ala-Ser-NH2 



excess 
71% MES'Na, 
pH -7.4 


[01171 


Fmoc-Arg-Asn-Arg-Ser-Gly-N'Y V^AIa-Asn-Ala-Ser-NHz 

19 Phe - Cys 

With this proof of principle in hand, attention was directed to 


investigating the generality of the method. For example, applicability of the method 
to incorporation of 0-linked domains^^ was established, as shown through the 

5 synthesis of 20. Notably, the histidine-containing system 21 was also assembled 
(for structure of 21 and 20, see Figure 1). Since activated C-terminal histidine 
residues are known to be particularly susceptible to epimerization,^^ a prophylactic 
dinitrophenyl (DNP) protecting group^ was used for the imidazole x-nitrogen 
during the synthesis. As expected, the DNP group was cleaved concurrently during 

10 the mildly nucleophilic coupling reaction. No indications of epimerization were 
observed by LCMS or NMR. 

[0118] Validation of the highly convergent methodology in the context of 

• 13 25 

complex glycans was then investigated. Syntheses of glycosylamines 22 and 23 
(Figure 2) have been reported previously. It will be noted that these compounds 

15 differ in a single stereogenic locus in the interior C-ring (see asterisks). Thus 22 
corresponds to a fully stereochemically competent iV^linked high mannose 
oligosaccharide. In contrast, its C-ring C-2 epimer is an unnatural stereochemical 
mutant. Studies aimed at evaluating whether high-mannose binding lectins can 
discriminate between point mutants that differ in only one out of a total of 25 

20 stereochemical loci are underway. 

[0119] Preparation of pentasaccharide-derived glycopeptide coupling 

partners proceeded via Lansbury aspartylation^^^ of 22 and 23 with 25 and 26, which 
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afforded 27 and 28, respectively. Coupling of 28 with a relatively simple 
chitobiose-containing acyl donor gave 24 (Figure 2) in 77% yield. 
[0120] Scheme 11. Convergent coupling of two functionalized 

glycopeptides containing distinct, structurally complex glycans.' 


0^ 


22 


0 1 Ji^ 
Ac-Arg-Asp-Arg-Ser-Gly-PhB-O"^^ 


25 


a,b 


OH 


AcHN 
23 


Finoe^s-Ala^p^la-Ser-NH2 
•BuSS 26 


NH 

■o 


EtS4:S 


Ac-Arg-Asn-Arg-Ser-Gly-N.^^Q-' 

I' 


<BuS-lrS 


H2N-Cjrs-Ala-Asn-Ala-S0r-NH2 


28 


q ho; 

AcHN 


Ac— Arg— ASn-— Arg— Ser— Gly— Phe— Cys— Ala— Asn— Ala— Ser— NHz 

29 


NH 


[0121] ° Reagents and conditions: (a) 25 or 26, HATU, DIEA, DMSO, 50% 

for 27 (from 22), 71% for 28 (from 23); (b) 20% piperidine in DMF, 63%; (c) 0.2 M 
phosphate, 0.2 M NaCl, pH --7.4, excess MES-Na, 75%. 

10 [0122] Coupling of 27 and 28 was accomplished smoothly upon reductive 

cleavage of the disulfide linkage in 27, affording 29 as shown in Scheme 11. Aside 
from validating the methodology m a striking way, the synthesis of 29 serves to 
pinpoint the power of the total synthesis approach: Since one of its carbohydrates is 
unnatural, 29 cannot readily be obtained from natural sources or via enzymatic 

15 manipulations. 

[01231 In summary, a convergent method for the synthesis of diglycosylated 

peptides is described herein. The mechanistic rationale set forth in Scheme 8 is 
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supported by the identification of the MES-Na thioester as a reaction intennediate. 
Furthermore, potential hydrolysis and C-terniinal epimerization of the glycopeptide 
acyl donor have been suppressed. It seems likely that the method and logic set forth 
above will enjoy application in the building of complex glycopeptides of biological 

5 and even medicinal consequence. 

[0124] It will be appreciated that the method can readily be applied to the 

preparation of multi-glycosylated peptides (e.g., comprising more than two non- 
adjacent glycosides covalently linked at designated sites). 
[0125] General Reaction Procedures; 

10 [0126] Unless mentioned specifically, reaction mixtures were stirred using a 

magnetically driven stirrer bar. Reactions involving air or moisture-sensitive 
reagents or intermediates were performed under argon or nitrogen atmosphere in 
glassware which had been heat gun or flame-dried under high vacuum. An inert 
atmosphere refers to either dry argon or dry nitrogen. Reactions were monitored 

15 either by thin layer chromatography, by proton nuclear magnetic resonance (NMR) 
or by high-pressure liquid chromatography (HPLC), of a suitably worked up sample 
of the reaction mixture, 

[0127] Indicated reaction temperatures refer to those of the reaction bath, 

while room temperature (rt) is noted as 22 °C. Preparative reactions were stined 

20 magneticaUy. Tetrahydrofuran (THF), diethyl ether (Et20), methylene chloride 
(CH2CI2), and toluene were obtained from a dry solvent system (activated aliraiina 
columns, positive pressure of argon). All other solvents were used as received in 
Sure/Seal bottles (Aldrich). Triethylamine (EtsN), diisopropylethylamine (/- 
PriNEt), pyridine, and 2,6-lutidine were distilled from CaH2 immediately prior to 

25 use. All other reagents were purchased from Aldrich at the highest commercial 
quality and used without further purification. 
[0128] General Work Up Procedures; 

[0129] Unless mentioned specifically, reaction mixtures were cooled to room 

temperature or below then quenched, when necessary, with either water or a 
30 saturated aqueous solution of anmionium chloride. Desired products were extracted 
by partitioning between water and a suitable water-immiscible solvent (e.g. ethyl 
acetate, dichloromethane, diethyl ether). The desired product containing extracts 
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were washed appropriately with water followed by a saturated solution of brine. On 
occasions where the product containing extract was deemed to contain residual 
oxidants, the extract was washed with a 10% solution of sodium sulphite in saturated 
aqueous sodium bicarbonate solution, prior to the aforementioned washing 

5 procedure. On occasions where the product containing extract was deemed to 
contain residual acids, the extract was washed with saturated aqueous sodium 
bicarbonate solution, prior to the aforementioned washing procedure (except in those 
cases where the desired product itself had acidic character). On occasions where the 
product containing extract was deemed to contain residual bases, the extract was 

10 washed with 10% aqueous citric acid solution, prior to the aforementioned washing 
procedure (except in those cases where the desired product itself had basic 
character). Post washing, the desired product containing extracts were dried over 
anhydrous magnesium sulphate, and then filtered. The crude products were then 
isolated by removal of solvent(s) by rotary evaporation under reduced pressure, at an 

15 appropriate temperature (generally less than 45°C). 
[0130] General Purification Procedures; 

[0131] Unless mentioned specifically, chromatographic purification refers to 

flash column chromatography on silica, using a single solvent or mixed solvent as 
eluent. Suitably purified desired product containing elutes were combined and 

20 concentrated under reduced pressure at an appropriate temperature (generally less 
than 45°C) to constant mass. Final compounds were dissolved in 50% aqueous 
acetonitrile, filtered and transferred to vials, then freeze-dried under high vacuum 
before submission for biological testing. 
[0132] Analytical Equipment ; 

25 [0133] HPLC; All separations involved a mobile phase of 0.05% TFA (v/v) 

in water (solvent A)/0.0425% TFA in acetonitrile (solvent B). Preparative, 
semipreparative, and analytical HPLC separations were performed using a Rainin 
HXPL solvent delivery system equipped with a Rainin UV-1 detector and one of the 
following Dynamax-60A CI 8 axial compression columns 250 mm in length 

30 equipped with a similarly packed guard column: 41.4 mm diameter (prep), 21.4 m 
diameter (semiprep), or 4.6 nun diameter (analytical). Separations were performed 
at flow rates of 48 mL/min (prep), 16 mL/min (semiprep), or 1 mL/min (analytical), 
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and were monitored at a wavelength between 214 and 230 nm, depending on column 
loading. LCMS chromatographic separations were performed using a Waters 2695 
Separations Module and a Waters 996 Photodiode Array Detector equipped with a 
Varian Microsorb C18 2 x 150 mm column at a flow rate of 0.2 mL/min. 
5 101341 ESMS and LCMS: Electrospray mass spectroscopy and LCMS 

analyses were obtained on a Waters Micromass ZQ mass spectrometer in 
conjunction with the Waters HPLC apparatus described above. 
[0135] NMR: and '^C NMR spectra were recorded on Bruker 

mstruments in CDCI3, CD3OD or D2O at 400 or 500 MHz for "H and 100 or 125 
10 MHz for ^^C. , 

[0136] Reagents: All commercial materials were used as received unless 

otherwise noted. The following solvents were obtained from a dry solvent system 
and used without further purification: THF, diethyl ether, toluene, and DCM. 
Reagents were obtained from Aldrich or as noted, with the following exceptions: 
15 amino acids and resins for solid phase peptide synthesis were purchased from 
NovaBiochem; Biosynthesis grade DMF from EM Science; and all other solvents 
from Fisher Scientific (HPLC grade). 

^OH [0137] To a stirred, biphasic solution of 2-mercaptophenol 

^s-^Y^ (1.0 g, 7.9 mmol) in H2O (5,1 mL) was added, drop wise, a solution 
"° 20 of iodine (1.0 g 4.0 mmol) in methanol (3.5 mL). When the brown 
iodine color persisted the solution was diluted with ethyl acetate and water. The 
aqueous layer was removed and extracted with an additional portion of ethyl acetate. 
The combined organic layers were dried and washed with brine then dried (Na2S04) 
and concentrated to give a brown oil which was used without purification (1.5 g). 
25 The product still contains iodine. *H NMR (CDCI3, 500 MHz): 5 7.33-7.37 (m, 2H), 
7.22-7.24 (m, 2H), 6.99-7.01 (m, 2H), 6.82-6.85 (m, 2H). 6.22 (brs, 2H). ^H NMR 
(CDCI3. 400 MHz): 8 7.35 (dt, J= 8.16, 1.65 Hz, 2H), 7.22 (dd, 7.52, 1.65 Hz, 
2H), 7.00 (dd, J- 8.16, 1.15 Hz, 2H), 6.83 (dt, J= 7.52, 1.15 Hz, 2H), 6.22 (s, 2H) 
^^C NMR (CDCI3, 125 MHz): 8 157.3, 136.6, 133.6, 121.4, 120.3, 116.1. ESI-MS: 
30 Calcd. for dzHioOzSa [M+NH^]^ 267.8 Found: 267.8. 

[0138] To a stirred solution of the disulfide (1.5 g crude, -4 

.^.s^ mmol) in CH2CI2 (25 mL) was added ethyldisulfide (10.5 mL, 79.9 
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mmol) and then BFs-OEtz (10.1 mL, 79.9 mmol). The reaction was stirred at room 
temperature for three hours and then carefully quenched by the addition of (aq.) 
NaHCOs. The organic layer was drained and the aqueous layer was extracted with 
an additional portion of CH2CI2. The combined organic layers were dried (MgS04) 
and concentrated to give a yellow oil. Purification by silica gel chromatography 
(20% ethyl acetate in hexane) gave the desired product as a clear, slightiy yellow oil 
(1.45 g, 99%). 'H NMR (CDCI3, 500 MHz): 8 7.48-7.51 (m, IH), 7.28-7.32 (m, 
IH), 6.99-7.01 (m, IH), 6.86-6.9 (m, IH), 6.34 (brs, IH), 2.78 (q, J= 7.4 Hz, 2H), 
1.35 (t, 7= 7.4 Hz, 3 H). '^C NMR (CDCI3, 125 MHz): 5 156.9, 135.2, 132.2, 
121.0, 116.2, 32.4, 14.1. ESI-MS: Calcd. for CgHioOSa [M+Naf 208.8 Found: 
208.8. 

[0139] To a solution of the phenol (1.45 g, 8 mmol) and 

JO Boc-Phe-OH (2.65 g, 10 mmol) in CH2CI2 (25 mL) and THF (5 

BoeHN'V°Y^ ^^^^'^ ^^^^ 

° f 15 (1.92 g, 10 mmol) and DMAP (98 mg, 0.8 mmol). The resulting 
solution was stirred at room temperature for 18 hr at which point the volatile 
materials were removed in vacuo. The resulting oil was taken up in EtOAc and 
washed with In HCl, H2O, and then brine. The organic layer was dried (Na2S04) 
and concentrated to give a slightly yellow oil. Purification by silica gel 

) chromatography (30% ethyl acetate in hexane) gave a clear, colorless oil (3.5 g, 
>99%). *H NMR (CDCI3, 400 MHz): 5 7.78-7.81 (m IH), 7.23-7.36 (m, 7H), 7.01- 
7.03. (m, IH), 5.02 (d, J = 8.16 Hz, IH), 4.86-4.91 (m, IH), 3.31 (abx. 
Jab = 13.98 Hz, Jax - 5.41 Hz, Jtx = 7.22 Hz, 2H), 2.72 (q, J« 7.33 Hz, 2H), 1.42 (s, 
9H), 1.29 (t, 7= 7.33 Hz, 3H). ESI-MS: Calcd. for C22H27NO4S2 [M+Na]^ 456.0 

5 Found: 456,0. 

[0140] To a 250 mL round-bottomed flask equipped with a 

magnetic stir bar was added the above phenylalanine derivative (3.0 

^^^l ^' ^'^ mmol) as well as a 4m solution of HCl in dioxane (86 mL, 

s 

10 SEt3o 345 mmol) and the reaction was stirred under argon at room 
temperature for 1 .5 hr. At that point the reaction was concentrated in vacuo, leaving 
a white solid. This material was triturated with ether and subsequently concentrated. 
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This process was repeated (3x) leaving the crude product as a white solid. The 
material was dissolved in 30% B, shell frozen and lyophilized to give a white 
powder (2.17 g, 94%). NMR (CD3OD, 400 MHz): 5 7.93-7.95 (m, IH), 7.52- 
7.42 (m, 7H), 7.23-7.21 (m,lH), 4.81 (dd, 8.35, 5.73 Hz, IH), 3.56 (abx, Jai, = 
5 14.48 Hz, Jox = 5.73 Hz, 7^ = 8.35 Hz, 2H), 2.85 (q, 7.34, 2H), 1 .37 (t, 7= 7.34, 
3H). '^C NMR (CDCI3, 125 MHz): 5 169.0, 149.4, 135.6, 131.9. 131.3, 131.0, 
130.7, 129.9, 129.6, 129.1, 123.9, 55.6, 38.0, 34.1, 14.9. ESI-MS: Calcd. for 
C17H19NO2S2 [M+Hf 334.1 Found: 334.2. 

^ [0141] The phenylalanine derivative 10 (15 mg, 45 uraol) 

jrV%0 and L-cysteine (6 mg, 49 ^mol) were placed into a LCMS vial 
"zN'Y I OH along with a flea-sized stirbar. In a second vial were mixed MES- 

^ ^SH 

Na (25 mg, 150 nmol) and phosphate buffered saline (PBS) (0.2M NaCl, 0.2M 
phosphate, pH = 7.5, 2 mL). The MES-Na solution was then added directly to the 
amino acids, and the reaction was monitored by LCMS. After two hours the 

15 reaction appeared to be complete and TCEP (129 mg, 450 \xmo\) was added. This 
was stirred for 1 hour and then injected directly onto the HPLC for purification. The 
desired compoimd was obtained as a white powder after lyophilization (9.4 mg, 
78%). 'H NMR (D2O): S 7.36-7.45 (m, 3H), 7.31-7.33 (m, 2H), 4.63 (dd, J= 6.61, 
4.83 Hz, IH), 4.34 (dd, J= 7.24. 7.18 Hz, IH), 3.25 (abx. Jab = 14.1 Hz, J^, = 7.18 

20 Hz, Jfa = 7.24 Hz. 2H). 2.96 (abx. Jot = 14.2 Hz, J„ = 4.83 Hz, /^c = 6.61 Hz, 2H). 
LCMS: 5-65%B over 20 min, rt = 8.53 min. HPLC: 5-95%B over 30 min. rt = 
11.45 min. ESI-MS: Calcd. for C12H16N2O3S [M+H]* 269.1, Found: 269.1 
[0142] Exemplary ligation conditions: 

[0143] The two glycopeptide halves (12, 2.2 mg, 1.44 ^imol) (18, 1.4 mg. 

25 1.44 nmol) were placed in a LCMS vial along with a flea-sized stirbar. A stock 
solution of MESNa (18.3 mg, 111 mmol) in phosphate buffered saline (0.2MNaCl, 
0.2M phosphate, pH = 7.4, 1 mL) was made and of this, 600 fiL was added to the 
glycopeptides. The reaction was monitored by LCMS and, once finished. TCEP (25 
mg, 0.087 mmol) was added and the solution stirred for 2 hr then injected directly 

30 onto the HPLC for purification. 
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[0144] LCMS: 5-65%B over 20 min, rt = 9.98 min. HPLC: 25-55%B over 

30 

5 rt = 7.65 min. ESI-MS: Calcd. for C93H138N24O37S [M+2H]^* 1108.5, Found: 
1108.6, (M+3H]'* 739.3, Found: 739.5. 

AcHN i y^K^ 


.0 


[0145] LCMS: 5-65%B over 20 rain, rt = 11.40 min. HPLC: 5-65%B over 

10 20 "li^ 
rt = 9.85 min. ESI-MS: Calcd. for C73H120N22O30S [M+2H]^* 909.4, Found: 909.5, 
P^+SH]'"^ 606.6, Found: 606.8. 


HO 

-OH 


"A ^ HO^ 


( ( 

^NH NH 


"® ACHN? °" 


ACHNj 


HN^NH, HN^NH^ OH OH 


1 5 [0146] LCMS : 5-65%B over 20 min, rt = 4.02 min. HPLC: 5-65%B over 20 

min, rt = 6.71 min. ESI-MS: Calcd. for C73H,2oN2203oS [M+2H]^* 898.4, Found: 
898.6. 
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10 


8822*5^ AcHN 


HN' HN' Hg55^ 


OH 


HO-^ OH 2^ 

(01471 LCMS: 5-65%B over 20 min, rt = 7.81 min. HPLC: 5-65%B over 20 

min, rt = 8.39 min. ESI-MS: Calcd. for C98H160N24O51S [M+2H]^* 1261.5, Found: 
1261.5, [M+SH]^* 841.4, Found: 841.5. 


HO-v OH 


HO-' OH 



HO-^ OH 


SH " I "OH 



29 hBsW 
HO-^OH 

[0148] LCMS: 5-45%B over 20 min, rt = 10.82 min. HPLC: 5-45%B over 

20 

rt = 9.31 min. ESI-MS: Calcd. for C98Hi6aN2405iS [M+2H]^^ 1504.6, Found: 
1504.9, [M+3H]^* 1003.4, Found: 1003.7. 

Abbreviations and Glossary 


[01491 

A: alanine 

[0150] 

Ac: acetyl 

[01511 

Ala: alanine 

15 [01521 

Arg: arginine 

[01531 

Asn: asparagine 

[01541 

Asp: aspartic acid 

[0155] 

Bn: benzyl 

[0156] 

Boc: ^er/-butyloxycarbonyl 
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[0157] Bu: butyl 

[0158] Bz: benzoyl 

[0159] CAN: eerie ammonium nitrate 

[0160] C-terminus: peptide carbonyl terminus 

5 [0161] Cys: cysteine 

[0162] D: aspartie aeid 

[0163] DIEA: iV^-diisopropylethylamine 

[0164] DMAP: //^-Dimethylaminopyridine 

[0165] DMF: dimethyl formamide 

10 [0166] DMSO: dimethyl sulfoxide 

[0167] DTBMP: di-^erZ-butylmethylpyridine 

[0168] DTBP: di-/err-butylpyridine 

[0169] Et: ethyl 

[0170] Fmoc: 9-fluorenylmethyloxycarbonyl 

15 [0171] G: glycine 

[0172] Gal: galactose 

[0173] Glc: glucose 

[0174] Gin: glutamine 

[0175] Glu: glutamic aeid 

20 [0176] Gly: glycine 

[0177] H: histidine 

[0178] HATU: 7-azahydroxybenzotriazolyl tetramethyluronium 

hexafluorophosphate 

[0179] His: histidine 

25 [0180] He: isoleucine 

[0181] K: lysine 

[0182] kDa: kilodaltons 

[0183] KLH: keyhole limpet hemocyanin 

[0184] L: leucine 

30 [0185] Leu: leucine 

[0186] Lys: lysine 

[0187] Man: maimose 
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[0188) 

MES-Na: 2-mercaptoethanesuiiomc acia, soaium saii 


[0189] 

MHC: major histocompatibility complex 


[0190] 

N: asparagine 


[0191] 

NAc: A^acetyl 

5 

[0192] 

NCL: native chemical ligation 


[0193] 

N-temiinus: peptide amine tenninus 


[0194] 

0-lmked: Imked through an ethereal oxygen 


[0195] 

PamSCys: tripalmitoyl-S-glycerylcysteinylserine 


[0196] 

PBS: phosphate-buffered saline 

10 

[0197] 

Ph: phenyl 


[0198] 

PMB: p-methoxybenzyl 


[0199] 

Pro: proline 


[0200] 

GP120: prostate specific antigen 


[0201] 

Py: pyridine 

15 

[0202] 

QS21: a glycosteroidal immunoadjuvant 


[0203] 

R: arginine 


[0204] 

S: serine 


[0205] 

sat. aq.: saturated aqueous 


[0206] 

Ser: serine 

20 

[0207] 

T: threonine 


[0208] 

TBAF: : tetra-«-butylammonium fluoride 


[0209] 

- - - 4 1*11 

TBS: /er/-butyldimethylsilyl 


[0210] 

/Bu: tert-hutyl 


[0211] 

Tf: trifluoromethanesulfonate 

25 

[0212] 

TFA: Trifluoroacetic acid 


[0213] 

THF: tetrahydrofuran 


[0214] 

Thr: threonine 


[0215] 

t-GP120: total prostate specific antigen 


[0216] 

Trp: tryptophan 

30 

[0217] 

V: valine 


[0218] 

Val: valine 


[0219] 

W: tryptophan 
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